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(57) Abstract 

The present invention provides crystallized pro- 
tective protein/cathepsin A (PPCA), a precursor thereof 
(pPPCA) or at least one subdomain thereof; methods 
for x-ray diffraction analysts to provide x-ray diffrac- 
tion patterns of sufficiently high resolution for three- 
dimensional structure determination of the protein, as 
well as methods for rational drug design (RDD), based 
on using amino acid sequence data and/or x-ray crys- 
tallography data provided on computer readable media, 
as analyzed on a computer system having suitable com- 
puter algorithms. 
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Protective Protein/Cathepsin A and Precursor: Crystallization, X-Ray Diffraction, Three- 
Dimensional Structure Determination and Rational Drug Design 



Background of the Invention 

5 Statement as to Rights to Inventions Made Under 
Federally-Sponsored Research and Development 

Pan of the work performed during development of this invention utilized U.S. Government funds. The U.S. 

Government has certain rights in this invention. 

Field of the Invention 

10 The present invention is in the fields of molecular biology, protein purification, protein crystallization, x-ray 

diffraction analysis, three-dimensional structure determination and rational drug design (RDD). The present invention 
provides crystallized protective protein/cathepsin A (PPCA) and its precursor (pPPCA). The crystallized PPCA or 
pPPCA is analyzed by x-ray diffraction techniques. The resulting x-ray diffraction patterns are of sufficiently high 
resolution to be useful for determining the three-dimensional structure of the PPCA or pPPCA protein, and for RDD. 

15 Related Background Art 

The human protective protein/cathepsin A (PPCA, also known as human protective protein or HPP) has been 
identified as the primary genetic defect underlying galactosialidosis (d'Azzo etai, Proc. NatL Acad. Set. U.S.A. 70:4535- 
4539 (1982)), a lysosomal storage disease inherited as an autosomal recessive trait. Patients with this disorder are 
diagnosed as having drastically reduced P-galactosidase and neuraminidase activities in their cell lysosomes. Examples 

20 of lysosomal storage diseases are presented in Table 316-1 of Braunwald et ai, eds., Harrison 's Principles of Internal 
Medicine, 1 1th Ed., pp. 1661-1671. McGraw Hill Book Co., New York (1987); as well as Wenger et ai. Biochem. 
Biophys. Res. Commun. 52:589-595 (1978); Tettamanti etaJ. eds.. Sialydases and Sialidosis. Perspectives in Inherited 
Metabolic Diseases, Vol. 4, Edi. Ermes, Milano (1981), pp. 261-279 and 379-395; and van Diggelen et. al. Lancet 
2:804( 1987), which references are entirely incorporated herein by reference.. 

25 Researchers have proposed that one of PPCA *s functions is to stabilize P-galactosidase and neuraminidase in 

a multi-enzyme complex, which complex is deficient in galactosialidosis patients (d'Azzo et ai (1982;, infra; Hoogeveen 
et ai (1983/ infra). Evidence for this protective function comes from studies showing that PPCA is taken up from the 
culture medium by galactosialidosis fibroblasts and that PPCA restores both P-galactosidase and neuraminidase activities 
to these fibroblasts (d'Azzo et ai (\9S2j, infra). 

30 The cDNA for PPCA directs the synthesis of a 452 amino acid precursor PPCA (pPPCA) (Figure 13) with a 

molecular weight of 54 kDa (Galjart et ai. Cell. 54:755-764 (1988)). The amino acid sequences of PPCA (Figure 14) 
and pPPCA (Figure 13) contain two glycosylation sites (Asn 1 17 and Asn 305), both of which are glycosylated in 
cultured fibroblasts and cells over-expressing PPCA or pPPCA. pPPCA dimerizes soon after synthesis in the 
endoplasmic reticulum (ER) (Zhou et ai, EM BO J. 70:404-4048 (1991)). 

35 Lysosomal PPCA has cathepsin A/deamidase/csterase activities which are exerted in vitro on a specific subset 

of bioactive peptides. Non-limiting examples of those hydrolyzed by PPCA are: substance P and substance P-free acid; 
oxytocin and oxytocin-free acid; neurokinin A; angiotensin I; bradykinin (Jackman infra. (1990). Furthermore, the 
enzyme inactivates endothelin I activity in rat smooth muscle cells and normal human tissues. This activity was deficient 
in liver from a galactosialidosis patient (Itoh, infra. 1995; Jackman et ai. J. Bio!. Chem. 26 7. 2 8 72-2 87 5, (1992). 

40 Endothelins (ET-I. ET-2 and ET-3) are potent vasoconstrictors and elevate blood pressure in mammals. They 

also intluence cell proliferation and hormone production and have been implicated in cardiovascular disorders, ranging 
from hypertension to stroke to ischemic heart disease (Rubanyi and PolokofT. Pharmc. Rev. 46:325-4 15(1 994)). 

The three-dimensional structure of a PPCA or a pPPCA has not previously been published, which structure 
could delineate specific biological activities and ligands as therapeutics for PPCA-related pathologies. Accordingly. 

45 there is a need to provide three-dimensional structures of at least one PPCA, pPPCA or ligands for diagnosis or therapy 
of PPCA-related pathologies. 
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Summary of the Invention 
The present invention provides methods of expressing, purifying and crystallizing a human protective 
proiein/cathepsin A (PPCA) and its precursor, precursor protective protein/cathepsin A (pPPCA). The present invention 
also provides methods for obtaining crystallized PPCA or pPPCA that can be analyzed to obtain x-ray diffraction 
patterns of sufficiently high resolution to be useful for three-dimensional structure determination of the protein. 

The x-ray diffraction patterns can be either analyzed directly to provide the three dimensional structure (if of 
sufficiently by high resolution), or atomic coordinates for the crystallized PPCA or pPPCA, as provided herein can be 
used for structure determination. The x-ray pattern/diffraction patterns obtained by methods of the present invention 
and provided on computer readable media, are used to provide electron density maps. The amino acid sequence is also 
useful for three-dimensional structure determination. The data is then used in combination with phase determination 
(e.g.. using multiple isomorphous replacement (MIR) molecular replacement techniques) to generate electron density 
maps of a PPCA or a pPPCA. using a suitable computer system. 

The electron density maps, provided by analysis of either the x-ray diffraction patterns or working backwards 
from the atomic coordinates, provided herein, are then fitted using suitable computer algorithms to generate secondary 
tertiary and/or quaternary domains of a PPCA or a pPPCA. which domains are then used to provide ar. overall three- 
dimensional structure, as well as expected binding and active sites of the PPCA or pPPCA. pPPCA has some of the 
active and binding sites of PPCA . except for changes in structure due to the presence of the portion of the pPPCA which 
is deleted during maturation to PPCA (e.g.. residues 285-298 of Figure 13). 

Structure determination methods and computer systems are also provided by the present invention for rational 
drug design (R.DD). These RDD methods use computer modeling programs to find potential ligands that are calculated 
to assoaate wuh. or bind to. sites or domains of a PPCA or a pPPCA. Potential ligand, are then screened for modulating 
or binding activity. Such screening methods can be selected from assays for at least one PPCA-specific structural feature 
or biologtcal activity, preferably as associated with a PPCA- or pPPCA-related pathology, e.g.. protect™ activity (eg. 
modulation of (J-galactosidase activity and neuraminidase (NA) activity); and peptide or enzyme modulating activity 
(eg., of endothelin I (serine carboxypeptidase), neuropeptides, cathepsin A. and the like), according to known assays 
The resulting ligands provided by methods of the present invention are synthesized and are useful for treating, inhibiting 
or preventing at least one of PPCA related pathology in a mammal. 

Other objects of the invention will be apparent to one of ordinary skill in the art from the following detailed 
description and examples relating to the present invention. 

Brief Description of the Figures 
Figure I: is a schematic ribbon diagram of the PPCA monomer (monomer 1), where Secondary structure 
ass.gnments are according to DSSP (Kabsch and Sander, Biopolymers 22:2577-2637 (1983)). The -core' domain is 
shown in yellow. The 'cap' domain consists of a 'helical* subdomain, in red, and a -maturation' subdomain in orange 
The catalytic triad Ser 150. His 429 and Asp 372 (from right to left) is shown by small green spheres. (Figure generated 
S3 using MOLSCRIPT (Kraulis, J. Appl. Cryst. 24:946-950 (1991))). 

Figure 2 is stereo diagram is presented of the C". trace of the PPCA monomer 1 with numbering of selected 
residues. The residues forming the o-helices and P-strands are as follows according to DSSP: 

Core domain: cpi (21-27); cp2(32-39); Cp3(50-54): Cal(63-67) CP4(73-75); Cp5(82-84)- CB6(94-98V 
Ca2(.,8-,35):Cp7(,44-.49);Ca3(152.,63^^ 

359); CP9(363-369): Ca8(377-386); Cpi0(39l -401 ); cpi 1(407-416): CP 1 2(4 1 9-424): C«i9(43 1-434); C„ 10(436-447)- 
Capdomain: Hb1(I83-I96); H«2(202-212): Ho t 3(226-240): Mpi(26l-264); Mp 2 (267-270): M*l(290-293)' 
MP3(296-299). Note that for monomer 2 the secondary structure assignments in the cap domain are slightly differem 
than in monomer 1. Residues in H0I are in aregion of poor density and Mai is an extended coil. (Figure venerated 
using MOLSCRIPT (Kraulis (199 1), infra). 
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Figure 3 shows the density for the disulfide bridges Cys 212-Cys 228 and Cys 2 1 3-Cys 2 1 8 is presented as 
revea ed in the S;gmaA weighted 2mFo . DFf electron density map (Rm(J ^ Q ^ 

ca.cu.a.ed from the mode, refined to 2.2 * the map has been contoured at ,„. (Figure drawn with the O comp 

program (Jones, Acta Crystaiiogr. 447: }) 0- ] 1 9 ( 1 99 1 ))). 

5 • h /'T/ " S ' ere0 dia8ram " PreSenttd ° f ^ SUperim ' ,osed C * «™« from the rwo crystallographicallv 

he f0minS ' ' in b,Ue ' — * 2 is in re erred , 

ne text are .abeled. Residues 259 and 260 have not been incorporated in the .ode. of monomer 2, since no elecl 

upper ngh, corner of the proteins. (Figure generated by MOLSCRIPT (Kntulis (1991) infra)) 
fold. • ? Ure5iSaSChema!krib ^ 

elZT 7 ^ m ° n0n,er 2 ' "* d0main ' 8reen ' WhHe the «P -n- -nsists of a b.ue 
hehcal subdomam and a i, s h. b.ue maturation subdomai, (Figure generated using MOLSCRIFT(K™i»(,99l) 

15 GRASP^ 8 rr? is v cpTOenu,ionof * emoieeuiarsurfaceof,he ^ 

co„«^ 

con-esponds to posmve potential > + 15.0 kT/e and dark red to a negative <-!5 0 kT/e nmmri.i c ^ 
view.a.ongthediad with the dimer oriented as in Figure,. Figure!* sde^Z ^ 
with respect to 6A. ^^ewofthed.mer.nmery degrees rotated 

Fi 8«"™-Fpre S emsatopologi^^ The arrancemem 

-^"^vary great,, ^^.^^^J^ 
and PDB entry codes are given in between brackets): Figure 7A shows the PPr a „ . 

domain helical; Figure 7C shows CPY (LYSC, Endrizzi « 0 / (19941 mm , , '. , ^'"f r ^ «P 

25 deha.ogenasepHAD.Ftanken^.. , L 0 0,29 302(^ 

_u yy *Wt cap domain he ica butqu tedifferem frnm 

5c/e*c e «j. 872-879 (1991)), cap doma.n large and predominantly «-helical The seconds « , 
agents were generated with the computer pro.ram O. using « ^ ^ / ^ ^ t 
30 Brookhaven Protein Data Ban, (This Figure was generated using MOLSCRlPT(Kraulis (199^) iiyTra)) 

Figure 8A-B shows the superposition of the C traces from the PPCA and Cpw m „ n „ m ' t • . 
™j"*tference S between then™^^^ 

and the 'he.ica. subdomain' is rotated with respect ,o ,h e CPW ou„Z n 7 * ^ 
^M)., igwSB ^^ 

>u. his 4.9 and Asp ,72 is shown as light blue spheres. Two cysteines Cvs '53 and r„« ^ r , 

45 rend . '° " ' C '° Se " UP repreSenta " 0n ° f ,he ' blockin S' C«idues 272-277) bound in the active 

45 -^-aMic.^ 



site. 

in orange, residues 
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from helica. domain in magenta and residues from the core domain in cyan. ^ excision ^ „ sh<jwn 
S,d char* are shown for readues making extensive contacts with the b.ocking peptide or if mentioned n the ,ex. Z 
catalyt.c tr.ad « sh wn in white. (Figure drawn with O (Jones (1991). infra)) 

5 form of P^T H " 8 KP ^ tnmim ° f e,emCTtS •"P— to * « the activation mechanism cf the precursor 

5 form of PPCA as discussed m the text. The C-oacc of the core domain is shown in cyan, the he.ical subdonl in Z 

J" SUMOm8ln " ~«* Md ««- P«** i» '"own in blue . Relevam side cnafns „ dep I * ^ 
labeled. Rearrangement of the residues 254-302 limited by the disulfide Cys 253 and r« im M , 

Figure 12 is a schematic representation of the proposed activation of PPCA. The active site cleft « f—i k 
the core domain (indicated as 'core' in the above scheme) and the he.ical subdomam (indicT L t ' t 7 " 
subdomain (indicated as 'm •) contains the residues that block the active site cleft rendeZT 
15 inactive, shown in structure ,. In the acidic endosome/lysosome. L^^Z^^T^ 

proves as a firs. step, followed by c.eavage of me po.ypep.ide chain removing the excisi™ peptide M eZ 7 
» pathway 2b. proteo.ytic c.eavage of the excision peptide might form the rigger for the J^IT^"' 
,eb,oc = 

rigure 13 shows the amino acid sequence of a human nPPr a tk. ■• 
.k- nuraan pi'w.A. The under ined port on (residues ~>m io«-> 

shows an exemon peptide for conversion to the mature form. PPCA. } 
Figure 14 shows me amino acid sequence of a human PPCA. 
^-"^owsasequenceaUgn^emb^^ 
rescues among all three sequences are boxed. Residue numbering i, included for the pP^A ^„7 

knowledge from the superposition of the CPW (Liao e, a/ 1992) and CPY fFndri„- , / US,ng 3D - sm,ctura » 

10 — . ' **"«iujw-4^)ana cap domain (residues 183-302} The 

30 secondary structure elements for the PPCA precursor are depicted with shaded bars (for deui s on the J „ , 
nomenclature, see Ruderdco eta!. Structure 3: 1249-1259 (1988) ). gnmem ^ 

Figure 16 show, a schematic represent*™ of a 'bootstrapping- cycle as described in Example 2 

nguren.sarepres^fcuWanin^^ 

wenasthenumberofcorrectsidechainst-c). Note wat after ^ round J ^ >" 

¥ igure 19 is a representation of a comparison of the C« trar.- fro™ . « "ncnu. 
comp.e.ePPCAmonomer^owninylw, ^ZZZZ^ZZ^^ 
PPCA monomer consists of a core domain and a cap domain. The heiica. subdomain an d ,h 
forming the cap domain have been shown in .he figure above. n,atUra, ' 0n SUbd ° main 
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F.gure 20A-D is a representation of Ihe res Iving power f the bootstrapping procedure showing three different 
stages ,n map quality. The atomic coordinates f the refined model are visualized with the electron density in Figures 
20B. 20C and 20D. Figures 20A and 20B show the initial 20,^1-0^1 SigmaA weighted map calculated using 
phases from the molecular replacement solution. The electron density is essentially unimerpretable. Fig. 20C shows 
5 twofold averaged2|F,J-|F„.| e lecrron density map calculated using inverted phases from cycle bmc6. The density for 
P-strand M P 2 (residues 266-271) has become clearly visible. Fig. 20D shows unaveraged 2m|F 0 J-D|F c . lt | SigmaA 
we.gh.ed map calculated using phases from the refined model. The quality of the density is very good. Density for the 
hehx Ma. (residues 287-293) which assumes a different conformation in the two monomers is now also apparent 

F,g U re21 shows a Ramachandran plot calculated for one monomer from a refined model of a pPPCA Both 
10 monomers in the asymmetric unit give essentially equivalent plots. 

Figure 22 shows a schematic of a computer system for PPCA or pPPCA structure determination and/or rational 

drug design. 

Figure 23.1-52 lists the atomic coordinates for the active site of a pPPCA dimer having the amino acid 
sequence presented as portions of at least one of 50-76. 144-155. 173-197,226-253,226-288.294-310 327-344 338 
350. 366-38. and 423-436 of (Figure 23.1-23.26) 452 amino acids (designated 1 -452) of monomer 1. as well as 
correspondmg portions of (Figure 23.26-23.52) 452 amino acids (designated 1001-1452) of monomer 2. 

Detailed Description of the Preferred Embodiments 
The present invention provides methods for expressing, purifying and crystallizing a protective 
protem/catheps.n A (PPCA) ora precursor protective protein/cathepsin A (pPPCA). where the crystal! diffract x-rays 
w,m sufficiently high resolution to allow determination of the three-dimensional structure of the PPCA or pPPCA or 
a port.on or subdomain thereof. The three-dimensional structure (*.g..as provided on computer readable media of 'the 
present m venoon) is useful for rational drug design of ligands of a PPCA or a pPPCA. Such ligands can be synthesized 
or recombmantly produced and are useful as diagnostic agents or drugs for diagnosing, treating, inhibiting or preventing 
at least one PPCA- or pPPCA-related pathology. g 

ir C, ' 0n f r ana,yttd ,0 Pr<>Vide " ,0miC m0dCl ° WPUt dB " C0 ™P°" d "* » *■ three-dimensiona. 
structure, e.g.. as prov.ded on computer readable media. The computer analysis of the atomic coordinate/x-ray 
d.ffracnon data and/or the amino acid sequence a..ows the calculation of the secondary, tertiary and/or quaternary 
s ructures; domains; and/or subdomains of the protein. These domains are combined and led by add Z 

f^ 

PPr A St T!7 ? etem,ina,i0n mtth0dS " 8,50 " rovided * P^«n, invention for rational drug design (ROD) of 

Z.c,ed r , P o nt OH ^ ^ ~ C ° mPU,er m0de,ing ^ *" ««™ 

ubd m 3 ^ 8C,iVe binding SiteS ' " 0ther « or 

n d £ " ' " ThCSe ' igandS Hn ^ ^ Pr ° dUCed ** M ( ° r in ««"•'*"■ or 

b.nd.ng to a PPCA or pPPCA. according to methods and compositions of the present invention 

The actual PPCA or pPPCA-.igand complexes can optionally be crystallized and analyzed usin* x-rav 
I T 777 ^ dif * aCli ° n " 00Uined ^ Simi,arly USed 10 M " ulate < h " dimensional inte^acti 

012; J PP , CA °; pppCA - to confi ™ ,hat ,he iigand bind$ ° r - «* £2 

oma,n(s or subdoma,„( S ) of the PPCA or pPPCA. Such screening methods are selected from assays for a, leas, on 
m'tf 7? . PPCA ° r 8 PPPCA - rKUl,inS ,igandS ' Pr ° Vided b > mMh0ds ° f the ^ i-n ion 

let;: r : PPW 

45 P CM; IPC? : "T ^ * hUmanS - ° f 3 PPCA - > PPCA «" n,-*. other 

*3 rl CAs or pPPCAs from other sources, such as other eukaryotes. 
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A PPCA or pPPCA is also provided as a crystalUzed protein suitable for x-ray diffraction analysis. The x-ray 
diffraction patterns obtained by the x-ray analysis are of moderate, to moderately high, to high resolution, eg. 30-10, 
10-3.5 or 1.5-3.5 A, respectively, with the higher resolutions included. These diffraction patterns are suitable and useful 
for three-dimensional structure determination of a PPCA or a pPPCA. domain or subdomain thereof. 
5 The determination of the three-dimensional structure of a PPCA or pPPCA has a broad-based utility. 

Significant sequence identity and conservation of important structural elements are expected to exist among different 
PPCAs or pPPCAs. Therefore* the three-dimensional structure from one or few PPCAs or pPPCAs can be used to 
identify ligands that have diagnostic or therapeutic value for at least one PPCA- or pPPCA-related pathology that may 
involve PPCAs or pPPCAs having different amino acid sequences. 

1 0 Defermination of Protein Structures 

Different techniques give different and complementary information about protein structure. The primary 
structure is obtained by biochemical methods, either by direct determination of the amino acid sequence from the 
protein, or from the nucleotide sequence of the corresponding gene or cDNA. The quaternary structure of large proteins 
or aggregates can also be determined by electron microscopy. To obtain the secondary and tertiary srruccure, which 

15 requires detailed information about the arrangement of atoms within a protein, x-ray crystallography is preferred. See, 
eg., Blundell, infra; Oxender, infra; McPherson, infra; Wyckoff, infra. 

The first prerequisite for solving the three-dimensional structure of a protein by x-ray crystallography is a well- 
ordered crystal that will diffract x-rays strongly. The crystallographic method directs a beam of x-rays onto a regular, 
repeating array of many identical molecules so that the x-rays are diffracted from it in a partem from which the structure 

20 of an individual molecule can be retrieved. Well-ordered crystals of globular protein molecules are large, spherical, or 
ellipsoidal objects with irregular surfaces, and crystals thereof contain large holes or channels that are formed between 
the individual molecules. These channels, which usually occupy more than half the volume of the crystal, are filled with 
disordered solvent molecules. The protein molecules are in contact with each other at only a few small regions. This 
is one reason why structures of proteins determined by x-ray crystallography are generally the same a*, those for the 

25 proteins in solution. 

The formation of crystals is dependent on a number of different parameters, including pH. temperature, protein 
concentration, the nature of the solvent and precipitant, as well as the presence of added ions or ligands to the protein. 
Many routine crystallization experiments may be needed to screen all these parameters for the few combinations that 
might give crystal suitable for x-ray diffraction analysis. Crystallization robots can automate and speed up the work of 
30 reproducibly setting up large numbers of crystallization experiments. 

A pure and homogeneous protein sample is important for successful crystallization. Proteins obtained from 
cloned genes in efficient expression vectors can be purified quickly to homogeneity in large quantities in a few 
purification steps. A protein to be crystallized is preferably at least 93-99% pure according to standard criteria of 
homogeneity. Crystals form when molecules are precipitated very slowly from supersaturated solutions. The most 
35 frequently used procedure for making protein crystals is the hanging-drop method, in which a drop of protein solution 
is brought very gradually to supersaturation by loss of water from the droplet to the larger reservoir that contains salt 
or polyethylene glycol solution. 

Different crystal forms can be more or less well-ordered and hence give diffraction patterns of different quality. 
As a general rule, the more closely the protein molecules pack, and consequently the less water the crystals contain, the 
40 better is the diffraction partem because the molecules are better ordered in the crystal. 

X-rays are electromagnetic radiation at short wavelengths, emitted when electrons jump from a higher to a 
lower energy stale. In conventional sources in the laboratory, x-rays are produced by high-voltage tubes in which a 
metal plate, the anode, is bombarded with accelerating electrons and thereby caused to emii x-rays of a specific 
wavelength, so-called monochromatic x-rays. The high voltage rapidly heats up the metal plate, which therefore has 
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■obecooled. Efficient cooling is achieved by so-called rotating anode x n, 

during the experiment so that different parts are heated up ^ m the me,al " late Solves 

5 ^Whenusedasanx-raysole^^ 

-age ring. Po.ychromat.c x . ny ^ J^^^"T * Channe ' ed ^ «* 

with wavelengths of 0.2 - 3.5 A. * ""^ W,nd0W ,hat «"«« *n>ugh x-ray radiation 

In diffraction experiments a narrow and parallel beam of x ra« • . l. 
onto the crysta, , 0 produce diffracted ^ ^ £ ■ «"» « from the x . ray 50UrC€ and directe<J 

-e.hectysta.frommanv^,^,^^ ^p^tybeammus, 
the beam during the experiment. P d " TraC "° n s P ots - «« » the crystal is rotated 

i5 "--^^ The 
15 detect directly in a digged form into a compu er eZL^ , ' ^ *"» ,he S *" als ** 

•he timere^redto collect and measure diffrac.L'^ 

— ™^ 

20 d,mens,on, array, as in . crysU , ^ ^ ^ 2Z ' " ^ * 3 ^ *- 

case, these x-rays, col.id.g from different di^^^^^^-«««^. 
w < add togethertoprod.ee diffracted beams ^2^^ ^ ^ 
de,ector - « can be recorded as .partem on a photographic plate or 

f COmain «- my from each of 2 °' 00<> """» ' haI h " 

•b- .ndivi dual aloms from suc „ a .Ui^ a, ° mS " * * «* information 

"-die such problems is cal.ed the Fourier transform ° mPmi,m - ^ -« *« is used ,o 

' S ,0SI * -periments. A« three propel t ne e ^ f f " " **« ™<* - *« 
*e Position of the atoms giving rise to the ^Z^*"**^^*^**^ 
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contribution to the diffraction pattern), such that there should not be too many of them (so that their positions can be 
located): and they should not change the structure of the molecule r of the crystal cell, i.e., the crystals should be 
isomorphous. Isomorphous replacement is usually done by diffusing different heavy-metal complexes into the channels 
of the preformed protein crystals. The protein molecules expose side chains (such as SH groups) into these solvent 
channels that are able to bind heavy metals. It is also possible to replace endogenous light metals in metalloproieins with 
heavier ones, e.g., zinc by mercury, or calcium by samarium. 

Since such heavy metals contain many more electrons than the light atoms (H, N, C, 0, and S) of the protein, 
they scatter x-rays more strongly. All diffracted beams would therefore increase in intensity after heavy-metal 
substitution if all interference were positive. In fact, however, some interference is negative; consequently, following 
heavy-metal substitution, some spots measurably increase in intensity, others decrease, and many show no detectable 
difference. 

Phase differences between diffracted spots can be determined from intensity changes following heavy-metal 
substitution. First, the intensity differences are used to deduce the positions of the heavy atoms in the crystal unit cell. 
Fourier summations of these intensity differences give maps of the vectors between the heavy atoms, the so-called 
Patterson maps. From these vector maps the atomic arrangement of the heavy atoms is deduced. From the positions 
of the heavy metals in the unit cell, one can calculate the amplitudes and phases of their contribution to the diffracted 
beams of protein crystals containing heavy metals. 

This knowledge is then used to find the phase of the contribution from the protein in the absence of the heavy, 
metal atoms. As both the phase and amplitude of the heavy metals and the amplitude of the protein alone is known, as 
well as the amplitude of the protein plus heavy metals (r.e., protein heavy-metal complex), one phase and three 
amplitudes are known. From this, the interference of the x-rays scattered by the heavy metals and protein can be 
calculated to see if it is constructive or destructive. The extent of positive or negative interference, with knowledge of 
the phase of the heavy metal, give an estimate of the phase of the protein. Because two different phase angles are 
determined and are equally good solutions, a second heavy-metal complex can be used which also gives two possible 
phase angles. Only one of these will have the same value as one of the two previous phase angles; it therefore represents 
the correct phase angle. In practice, more than two different heavy-metal complexes are usually made in order to give 
a reasonably good phase determination for all reflections. Each individual phase estimate contains experimental errors 
arising from errors in the measured amplitudes. Furthermore, for many reflections, the intensity differences arc too small 
to measure after one particular isomorphous replacement, and others can be tried. 

The amplitudes and the phases of the diffraction data from the protein crystals are used to calculate an electron- 
density map of the repeating unit of the crystal. This map then has to be interpreted as a polypeptide chain with a 
particular amino acid sequence. The interpretation of the electron-density map is made more complex by several 
limitations of the data. First of all, the map itself contains errors, mainly due to errors in the phase angles. In addition, 
the quality of the map depends on the resolution of the diffraction data, which in turn depends on how well-ordered the 
crystals are. This directly influences the image that can be produced. The resolution is measured in A units; the smaller 
this number is. the higher the resolution and therefore the greater the amount of detail that can be seen. 

Building the initial model is a trial-and-crror process. First, one has to decide how the polypeptide chain 
weaves its wnv through the electron-density map. The resulting chain trace constitutes a hypothesis, by which one tries 
to match the ' ,nsity of the side chains to the known sequence of the polypeptide. When a reasonable chain trace has 
finally been obtained, an initial model is built to give the best fit of the atoms to the electron density. Computer graphics 
are used both for chain tracing and for model building to present the data and manipulated the models. 

The initial model will contain some errors. Provided the protein crystals diffract to high enough resolution (e.g.. 
better than 3.5 A), most or substantially all of the errors can be removed by crystal lographic refinement of the mode l 
using computer algorithms. In this process, the model is changed to minimize the difference between the experimental! v 
observed diffraction amplitudes and those calculated for a hypothetical crystal containing the model (instead of the real 
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molecule). This difference is expressed as an R factor (residual disagreement) which is 0,0 for exact agreement and 
about 0,59 for total disagreement. 

In general, the R factor is preferably between 0.15 and 0.35 (such as less than about 0.24-0.28) for a well- 
determined protein structure. The residual difference is a consequence of errors and imperfections in the data. These 
derive from various sources, including slight variations in the conformation of the protein molecules, as welt as 
inaccurate corrections both for the presence of solvent and for differences in the orientation of the microcrystals from 
which the crystal is built. This means that the final model represents an average of molecules that are slightly different 
both in conformation and orientation. 

In refined structures at high resolution, there are usually no major errors in the orientation of individual 
residues, and the estimated errors in atomic positions are usually around 0.1-0.2 A, provided the amino acid sequence 
is known. Hydrogen bonds, both within the protein and to bound ligands, can be identified with a high degree of 
confidence. 

Most x-ray structures are determined to a resolution between 1 .7 A and 3.5 A. Electron-density maps with this 
resolution range are preferably interpreted by fining the known amino acid sequences into regions of electron density 
15 in which individual atoms are not resolved. 

An amino acid sequence is preferred for accurate x-ray structure determination. Thus, recombinant DNA 
techniques have had a double impact on x-ray structural work. When a protein is cloned and overexpressed for structural 
studies, the amino acid sequence, necessary for the x-ray work, is also quickly obtained via the nucleotide sequence. 
Recombinant DNA techniques give us not only abundant supplies of rare proteins, but also their amino acid sequence 
20 as a bonus. See. e.g., Blundell, infra; Oxender, infra; McPherson, infra; WyckofT, infra. 
Isolated PPCA and pPPCA Polypeptides 

A PPCA or pPPCA polypeptide can refer to any subset of a PPCA or pPPCA as a domain, subdomain, 
fragment, consensus sequence or repeating unit thereof. A PPCA or pPPCA polypeptide of the present invention can 
be prepared by, e.g..: 
25 (a) recombinant DNA methods; 

(b) proteolytic digestion of the intact molecule or a domain, subdomain or fragment thereof; 

(c) chemical peptide synthesis methods well-known in the art; and/or 

(d) by any other method capable of producing a PPCA or pPPCA polypeptide and having a conformation 
similar to a structural or functional subdomain of a PPCA or a pPPCA. 

A biological activity of PPCA or pPPCA can be screened according to known screening assays. The minimum 
peptide sequence to have activity is based on the smallest unit containing or comprising a particular domain, subdomain, 
fragment, region, consensus sequence, or repeating unit thereof, having at least one biological activity of a PPCA or 
pPPCA, such as protecting activity, inhibiting activity or enzyme activity. Non-limiting examples of such activities are: 
protecting activity for P-galactosidase or neuraminidase (NA); modulating activity (inhibition, stimulation or activation) 
35 as an for endothelin I (serine carboxypeptidase) or cathepsin A and peptide hydrolyzing activity (e.g.. substance P and 
substance P-free acid; oxytocin and oxytocin-free acid; neurokinin A; angiotensin I; and bradykinin. 

According to the present invention, a PPCA or pPPCA includes an association of two or more polypeptide 
subdomains. such as at least one 4 amino acid portion of a core or cap domain of a PPCA or pPPCA. This can include 
1-14 subdomains of the cap domain anoVor 1-44 subdomains of the core domain (as monomers ordimers). or any range, 
40 value or combination thereof. Preferably 1-4 sets of each of at least one core or cap domains or subdomains are 
included. 

The structure of a monomer or domain of at least one PPCA includes at least one subdomain of a PPCA of a 
pPPCA of the present invention can include one or more of the following subdomains, as described herein. Generally 
a PPCA or pPPCA consists of a dimer of a core domain and a cap domain having the following subdomains having rhe 
45 specified residues, e.g., as presented in Figure 13 (pPPCA) or Figure 14 (PPCA):: 
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Core domain subdomains: Cpi. 21-27; Cp2, 32-39; CP3. 50-54; Col. 63-67" CpM 73-75- CQ5 82 
84; CP6. 94-98; Co2, 1 18-135; CP7, 144-149; C«3. 152-163; CPS. 17.-177; Ca4. 307-313- 0*5*316 321- 

C«6.336-341;Ca7,350-359;CP9.363-369;C«8,377-386;Cp.0.39M01;CPl..407-416;Cpi2 419-424- 
Co9, 43 1-434: ColO. 436-447; and ' 

Cap domain subdomains: Hal. 183-196; Ha2, 202-212; Ho 3,226-240; Mpl. 261-264- MP^ 267 
270; Ma 1. 290-293; MP3, 296-299. Note ,ha, for monomer 2 the secondary structure assign.™* in Z cap 
domain are slightly different than in monomer 1 . 

A PPCA or pPPCA polypeptide of the invention can have at least 80% homology, such as 80-100% overall 
homology or identity, with one or more corresponding PPCA or pPPCA subdomains or fragments as described herein 
such as a 4-542 ammo acid fragment or portion of the amino acid sequence of Figures 13. 14 or 15. As would be' 
understood by one of ordinary ski., in the art. the above configurations of subdomains are provided as part of a PPCA 
or pPPCA po.ypep,ide of the invention, when expressed in a suitable host cell, or otherwise svnthesized, to provide at 
leas, one statural or functional feature of a native PPCA or pPPCA. such as a, least one PPCA-relaed biological 
activ.ty. Such activities can be assayed using a suitable assay, to establish at leas, one PPCA bio.ogica. activity of one 
or more PPCAs or pPPCAs of the invention. A PPCA or pPPCA polypeptide of the invention is not natura.ly occurrino 
or ,s naturally occurring but is in a purified or isolated form which does not occur in nature. Examples of suitable PPCA 
activity assay inc.ude. eg., cathepsin A activity (Ga.jar, « a,.. J. Bid Cke m . 266A 4754-14762 (1991); Endothe.in 1 

T " ° L 1 Bi ° L ^ 267aS12 - 2 ™^ - "'hykinin ^midase activity (Jackman 
el a/., J. Biol. Chem. 265:) 1265-1 1272 (1990)). <»«man. 

Percent homology or identity can be determined, for example, by comparing sequence information using the 
GAP computer program, version 6.0. available from the University of Wisconsin Genetics Computer Group (UWGCG) 
The GAP program utilizes the alignment method of Needleman and Wunsch (7. Mol. Biol. 48:443 (1970) as revised 
by Sm.th and Waterman fr,* A PP l Math . 2:482 (198.). Briefly, the GAP program defines similarity ns t'he number 
of aligned symbols (,e.. nucleotides or amino acids) which are similar, divided by the total number of symbo!s in the 
shorter of the two sequence, The preferred default parsers for the GAP program include: ( 1 ) a unitary comparison 
matrix (conta.ninga va.ue of 1 for identities and 0 for non-identities) and me weighted comparison matrix of Gr bskov 
and Burgess. Nucl. Acids Res 14:6745 (1986). as described by Schwartz and Dayhoff. eds ATLAS OF PRorZ 
fCMCEAVOSTRUCTU^^^ 

for each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no penalty for end gap* ' 
Thus, one of ordinary skill in the art. given the teachings and guidance presented in the present specification 
w,M know how to add. de!ete or substitute other amino acid residues in other positions of a PPCA or pPPCA to obtain 
substituted, deletional or additional variants thereof. 

Non-limiting examples of substitutions of a PPCA or pPPCA domains or polypeptide of the invention are those 
m which at leas, one amino acid residue in the protein molecule has been removed and a different residue added in its 
Place accord.ng ,o the fol.owing Table 2. The types of substitutions which can be made in the protein or peptide 
mo eculc of the mven.ion can be based on analysis of the frequencies of amino acid changes between a homo.ogous 

def T ^ 11,056 PfeSented in ' y ^ ° n SUCH " substitution^ 

aenned herein as exchanges within one of the following five groups: 

1 Small aliphatic, nonpolar or slightly pola, residues: Ala. Ser, Thr (Pro. Gly); 

2 Polar, negatively charged residues and their amides: Asp. Asn. GIu. Gin; 

3 Polar, positively charged residues: 
His. Arg, Lys; 

4. Large aliphatic; nonpolar residues: 

Mel. Leu. He. Val (Cys); and 
5 Large aromatic residues: Phe. Tyr. Trp 

Most deletions and additions, and substitutions according to the invention are those which do not produce 
rad-ca. changes ,n the characteristics of the protein or peptide molecule. "Characteristics- is defined in a non-inclusive 
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manner ,o define bo«h changes in secondary structure, e.g. a-helix or as well as changes in physio.ogica. 

cuv.ty. In bl0l0glcal activitv asMy , Howeyefi whefl ^ ^ effect of ^ O Jo ^ 

obe conned, one skiHed in the art wi I, appreciate that the 

be eva uated by a, ieas, one PPCA or pPPCA screening assay, such a, bu, no, limited to. immunoassays or b I ay 
j to conrimi at least one PPCA or pPPCA biological activity. 

Surprisingly, a PPCA and/or a pPPCA is now discovered to have serine carboxypep.idase activity and 
or e spondm strjctura , fMtureS) a| , hough havjng on|y abom ^ ^ P ty an 

10 Th '812 (1992). Endnzzi el a!., Biochemistry 33:] 1106-1 1120(1994); Oliis etat.. Protein Eng. J' 1 97-21 1 (1992)) 
PPCA possess , *„„, ^x^,^ „ bit| , „ t 

•°/»C4 andpPPCA Expression for Isolation and Purification 

A nucleic acid sequence encoding a PPCA or a pPPCA (Galjart e, aL Cell. 5<,-755.764 0988.) can h , 

«ermm, for hgat.on. restriction enzyme digestion to provide appropriate termini. fining in of cohel enl a 
approve Valine phosphatase treatment to avoid undesirable joining, and ligation U a P l^e 
Techmques for such manipulations are disclosed, e.g., in Sambrook « <?A, Clonic A Z7«o u 

Second edition. Cold Spring Harbor Laboratory. Cold Spring Harbor NY ,989V and Au7TT ^ ^ ' 
25 *^^Wi,ey.„te« 

A nucletc acid molecule, such as DNA, is said to be "capable of examine- * 

operably linked to nucleotide sequences which encode the polyoeDtide An on^KU v i, ■ , 

30 expression is a PPCA , pPPCA of fragment thereof, in recoverable amounts Tne.^jll, ^ 

The invention accordingly encompasses the expression of a PPCA or a nPPr a -.i. 

Preferred E t ry0,iC !" >S,S inC ' Ude ^ ^ "* — ™"» «* ei « h " " or in tissue culture 

K) c^7^e7 flS 7 a ' SOinC,Ude " b ^ 

CHO K , T" " C " in< ' Ude ^ HeU ee » $ - «" s ° f Ooblast origin such as VERO o 

CHO-K I . or cells of lymphoid origin and their derivatives. ° r 

Mammalian cells provide post-translational modifications to protein molecules including r 
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DUXB1 1 J i and their derivative, One preferred type of mamma.ian cells „ ce „ s which „ intended t0 

least one PPCA or pPPCA. A w.de vane* of transcnpnonal and translational regulatory sequences can be employed 
^e^ 

source,, such as. bu, no, hm.ted to. adenovirus, bovine papiHoma virus. Simian virus, or the like where the reau J ^ 

in;: associa,ed wiih a particu,ar gene which ha$ 4 high ~ * -»«— ■ A.,^cir:: 

"t7" Pr ° dUC,S - SUCh «• bu ' " 0t limited - ««». coll^n. myosin, protein production 

10 for large Z^TcT ^ ^ T ^ ^ " ^ *™ 

o arge scale PPCA or pPPCA producon accord.ng to the invention. Production of PPCA or pPPCA in insects can 
be ach.eved. for example, by infecting the insect host with a baculovirus engineered ,„ / 
po.ypeptidebymethods.ownto.oses.Hedinthere.atedart, U.ZZ^Zw""**" 
In a preferred embodiment, the introduced nucleotide sequence will be incorporated into a pUsmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of vectors can b elll f 

of .mportance „ selecng a particular plasmid or viral vector include: the ease with which recipient cells that conL" 
•he vector can be recognized and selected from those recipient cells which do no, contain the vector ^1 r 

Different host cells have characteristic and specific mechanisms for the translational and nn« , , , 
chosen to ensure the des-red modification and processing of the foreign protein expressed For example ZZZ 

PPCA or pPPCA. Furthermore, d.fferem vectorAost expression systems can effect processing reactions such a, 
proteolyt.c cleavages to different extents. g SUCh 35 

reoions rr SSed ' b 7^ 

2 ' TST, m inC ' Ude * re8i ° n maMM » ,h < -T «NA syn i! 

See. e.g. , Ausubel, infra; Sambrook. infra. syndesis. 
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Once the vector or nucleic acid molecule containing the consmict(s) has been prepared for expression the DNA 

-nsfecon conjugate protoplast fcsion, election, panicle gun technology, calcium phosphate-pr dl, ' 

which selects for the growth of vector-containing cells. Expression of the cloned gene mo.eculefs) results in T 
35 producon of a PPCA or pPPCA. This can take place in the transformed cells as such or foHow H 

thesece, r;~ 

A PPCA or pPPCA. or fragments thereof, of this invention can be obtained by expression fro™ ™k- 

ot^PPCa''' 1 ^ 'h ' U,0VVn nleI ' 10C ' s ' Alternatively, a PPCA or pPPCA can be purified from biolo^cal rnatsriaT A PPCA 

40 Li * T ""I Cd difTerem mamma "' an ,iSSUW {e 8 - hUma " - — >iv io kZ 

4U bovme testes, bovme liver, and'the like) of various genus and species. = * 

extraction^ " iS0 ' ated ^ * With ™hod «eps such as 

extract on. precp.tat.on. chromatography, affinity chromatography, electrophoresis, or the like For examole l 

45 ll bo V ; ' S ^ ^ C °' Umn Chromato ^ ror examp.e. on DEAE-ce.lu.ose. phosphoce u^e 
45 P^cyndyhcac.d-ag^ 
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orPTCAcanbeKo.a.edby.heuseo^^ ^ 

TvutrVl ° b,ained ^ k " 0Wn me ' h0d ( " e - **' HBrl0W Wd Lane GOODIES. A LABORATORY 
MANUAL Cold Sprmg Harbor Laboratory (,988); Colligan e, al, eds.. Current Protocols in , mmunolog> , Greene 
Pubhsh.ng Assoc. and Wiley .mergence, N.Y., ( ,992. ,993). the contents of which references are emirely incorporated 

-> herein by reference). H 

A PPCA or a pPPCA can be purified from different mammalian .issues (e.g.. human placenta, ra, liver mouse 
hver p,g k.dney bovine testes, bovine liver, and the like) of various genus and species, using known techniques such 
as gel filn^on, phase separation and affinity chromatography, fusing pdyclonal or monoc.ona. antibodies specific 
10 (^o) aCCOrdi " S 10 kn ° Wn meth ° dS - ^ °" ndere ' fl/ - »~* Bering. Liss, New York 

Overview of PPCA orpPPCA Purification and Crystallization Methods 

In general, a PPCA or pPPCA is isolated in soluble form in sufficient purity and concentration (e.g.. a monomer 
or d, me r) for crystallization. The PPCA or pPPCA is then iso.ated and assayed for biologica, activi J J,, calh ^ 
A) and for lack of aggregation (which interferes with crystallization). The purified PPCA or pPPCA preferably L 
« a single band for each monomer under reducing or nonreducing polyacrylamide gel electrophoresis (PAGE) 
(nonreducmg,s used to evaluate the presence of cysteine bridges). (PAGE) 
The purified PPCA or pPPCA is preferab.y crystallized under varying conditions of at leas, one of the 
ngandTand ^ U ^ CT ^P e ' buffer concentration, salt type, polymer type, polymer concentration, other precipitating 
C I, IT™ PPCA " PPPCA ' ^ **■ ^ * - - Protein 

C wf' , ^ L ° nd0n ° 976); ° Xendef - ^ "* "" d ^ of Protein 

Crystals, W„ey .nterscience, N.Y. (,982)) or methods provided in a commercial kit, such as CRYSTAL SCREEN 

(Hampton Research. Riverside, CA). The crystallized PPCA protein can optionally be tested for at least one PPCA 
^sp^ 

ZZTr^Tu See - Blunde "- 0xender ' /n>a: McPherjon - 

Protein Crystallization Methods K } 

The hanging drop method is preferably used ,o crystallize the purified protein. See, e.g., Blundell infra 
30 (1992), infra; CRYSTAL SCREEN, Hampton Research. 1290(1992), Tak.moto 

A mixture of the purified protein and precipitant can include the following' 

• pH (e.g., 7-9); 

• buffer type (e.g.. tromethamine (TR1ZMA), sodium azide (NaN l)( phosphate, sodium, or cacodylate 
acetates, im idazole, Tris HC1, sodium hepes); 

* • buffer concentration (e.^., 1-100 mM); 

• sal, type (.,.. sodium azide. calcium chloride, sodium citrate, magnesium chloride, ammonium 
acetate, ammonium sulfate, potassium phosphate, magnesium acetate, zinc acetate: calcium acetate) 
polymer type and concentration: (e.g.. polyethylene glycol (PEG) 1-50%, type 400-10 000)- 

) • other additives (sa,ts: potassium, sodium, tartrate, ammonium sulfate, sodium acetate, lithium sulfate 

sod.um formate, sod.um citrate, magnesium formate, sodium phosphate, potassium phosphate' 
organ.es: 2-propanol; non-volatile: 2-methy|.2.4-pentanediol); P-octyl slucoside and 

• concentration of purified PPCA orpPPCA (e.g.. I.0-100mg/ml) 
See. eg.. CRYSTAL SCREEN. Hampton Research. 

A non-limiting example of such crystallization conditions is the following: 

• purified PPCA or pPPCA protein (e.g.. 5 mg/ml); 
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• (2) solutions in serial mixtures 

0) 40-80 mM TRIZMA, 0.05-2.0 mM NaN, ■ 

005-2.0 Z NaNj 2 " 3 ° % P0 ' ye,hylene 8lyC01 (PEC) 8000 bU,rered 40 " 80 ^ ™ ZMA - 
5 * 0.05-0.5% p-octyl glucoside; 

• at an overall pH of about 8.0-8J. 

The above mixtures are used and screened by varying at least one »f „u u n 
precipitating sa,t type or additive or their concentration, lo typ "ZTl C0nCemra " On - 

Blunde.1. infra; Oxender. McPherson. infra; Wyckoff, *, ' A " 3,40 «*■ 

Protein Crystals 

15 Crystals appear after 1-14 days and continue to grow on subsequent days Som e „f,H 

op«.onally removed, washed, and assayed for bio.ogical activity (e g ncTLZ 'JZ 7 ^ 
further characterizations. Other washed crystals are P^ «t ^ ^ ^ 

same position as the purified PPCA or pPPCA are preferablv L, R 058 *" migr3te in the 

°--~™c^ 

20 x-ray analyses indicate that such crystals diffract a, modera, |y hig 1 1^ ' ^ ^ 

produced in a drop, they can be much larger size e g 9 m m c T ™ 

McPherson. ,„/,„; Wyckoff. infra; **' BIUnde "- °«" d «- 

tt»Of andpPPCA X-ray Crystallography Methods 

to-SO-OareoptionaHyusedforlongerx-rayexposures^g 5-72 J * 7l , * ^ ' f™" 

- *e crysta. is routed in the x-ray beam. ,,, for 5 . 72 hrs . I^^^S^^ 7 ^ " 
resolution of the x-ray diffraction patterns obtained Ctysta.s are or^T , , " " * ^ *« 

resolution of 10-1.5 A, with lower resolutions also useful such as 25 l5T.iT lea * » «lat>vely high 

~e of a PPCA or pPPCA in considerable detai,. as pr ^ZT * **~ ** 

Passmg an x-ray beam through a crystal produces a diffraction pattern as a result of ,h. , 
bong scattered by the contents of the crystal The diffraction n a «, m r ,aSareSUl,of,hex - ra . vs »" t ^'insand 

Panem are used , 0 determine parameters ^insiclecrys a^ 

■he packing of the molecules in the crysta. The in, «T „ 7 ' ^ " d l ° gain M °™™ on 

--'^hecrys^ 

^-ol.ectionof^^^ 

« , 0 yield a : of :::: ~ ^rr: ° f : to bea - 10 

molecule crystallized. The amplitudes of the diffracted x-rays are then col „ deKfmina "°" «f <»c 



WO 97/15588 



-15- 



PCT/US96/17325 



present in the crystal) is built. The phases can be determined with various known techniques, one being molecular 
replacement. 

For the molecular replacement technique one takes a known three dimensi nal structure thought to share 
structural homology with the structure to be determined, to generate after calculations a first set of initial phases. These 
phases are then combined with the diffraction information of the molecule for which you want to solve the structure of. 
The result is an electron density map of the molecules in the crystal from which the diffraction patterns originate. 

The phases can be further optimized using a technique called density modification, which allows electron 
densiry maps of better quality to be produced facilitating interpretation and model building therein. The atomic model 
is then refined by allowing the atoms in the model to move in order to match the diffraction data as well as possible 
while continuing to satisfy stereochemical constraints (sensible bond lengths, bond angles and the like). See, e.g., 
Blundell, infra\ Oxender, infra; McPherson, mfra\ Wyckoff, mfra\ 
Computer Related Embodiments 

An amino acid sequence of a PPCA or pPPCA and/or atomic coordinate/x-ray diffraction data, useful for 
computer structure determination of a PPCA, pPPCA or a portion thereof, can be "provided- in a variety of mediums 
to facilitate use thereof. As used herein, provided refers to a manufacture, which contains a PPCA or pPPCA amino acid 
sequence and/or atomic coordinate/x-ray diffraction data of the present invention, e.g.. the amino sequence provided 
in Figures 13-15, a representative fragment thereof, or an amino acid sequence having at least 80- 1 00% overall identity 
to a 5-542 amino acid fragment of an amino acid sequence of Figures 13-15. Such a method provides the amino acid 
sequence and/or atomic coordinate/x-ray diffraction data in a form which allows a skilled artisan to analyze and 
determine the three- dimensional structure of a PPCA, a pPPCA or a subdomain thereof. 

In one application of this embodiment, PPCA, pPPCA, or at least one subdomain thereof, amino acid sequence 
and/or atomic coordinate/x-ray diffraction data of the present invention is recorded on computer readable media. As 
used herein, "computer readable media" refers to any medium which can be read and accessed directly by a computer. 
Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as optical discs or CD-ROM; electrical storage media such as RAM and ROM; 
and hybrids of these categories such as magnetic/optical storage media. A skilled artisan can readily appreciate how any 
of the presently known computer readable media can be used to create a manufacture comprising computer readable 
medium having recorded thereon an amino acid sequence and/or atomic coordinate/x-ray diffraction data of the present 
invention. 

As used herein, "recorded" refers to a process for storing information on computer readable medium. A skilled 
artisan can readily adopt any of the presently known methods for recording information on computer readable medium 
to generate manufactures comprising an amino acid sequence and/or atomic coordinate/x-ray diffraction data information 
of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a computer readable medium 
having recorded thereon an amino acid sequence and/or atomic coordinate/x-ray diffraction data of the present invention. 
The choice of the data storage structure will generally be based on the means chosen to access the stored information. 
In addition, a variety of data processor programs and formats can be used to store the sequence and x-ray data 
information of the present invention on computer readable medium. The sequence information can be represented in 
a word processing text flic, formatted in commercially-available software such as WordPerfect and MICROSOFT Word, 
or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like! 
A skilled artisan can readily adapt any number of dataprocessor structuring formats (e.g. text file or database) in order 
to obtain computer readable medium having recorded thereon the information of the present invention. 

By providing on computer readable media having stored therein a PPCA or pPPCA sequence and/or atomic 
coordinates based on x-ray diffraction data, a skilled artisan can routinely access the sequence and atomic coordinate 
or x-ray diffraction data to model a PPCA. pPPCA. a subdomain thereof, or a ligand thereof. Computer algorithms are 
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public.y and commercially avai.able which a,.ow . skil.ed artisan «o access this data provided on a confer readable 
The present invention farther provides system, particularly computer-based systems, which contain the 

Z^£T* data deseribed herein ' Sueh sys,ems are desiened to do ~< d — - 

i abi 1 P S r " „ SUM ° main M NOn -' imitin8 Mampl « « -Computer workstations 

Tpels™ 

As used herein, "a computer-based system" refers to the hardware means, software means, and da.a s,ora 8 e 
10 means used to anaiyze the sequence and/or atomic coordi„a,e/,ray diffraction data of the present irvention T 

(CPU), ,npu means, output means, and data storage mean, A skilled artisan can readily appreciat which of th 



s^ed ,ht:r;tcT e B ^rT er ' based sys,ems " *• presem inven,ion comprise a da,a ^ — 

stored tnere.n a PPCA. pPPCA or fragment sequence and/or atomic coordinate/x-ray diffraction data of the Dr «™ 
■nvennonandthenecessaty hardware means and software means for ^ S ^ m J^^^ 
data ofth ere ' n S "* ra ^ e means " re fws to memory which can store sequence or atomic coordinate/x-ray diffraction 
data of the present ■nvent.on. or a memory access mean, which can access manufactures having recorded there 
20 sequence or x-ray data of the present invention. e 
As used herein, "search means" or "analysis means" refers to one ormore programs which are implemented 
on the computer-based system to compare a targe, sequence or target s^ra, mo.Lith the sequence oTxIy 
s.ored w„h,„ the data storage means. Search means are used to identify fragments or regions! PPCA or PPCA 

ys« ms of the present mvent.on. A skilfcd artisan can readi!y recognize that any one of the available algorithms or 
^^^^^^^^^^^^^ 

30 combing V herein ' ^ ,argW ™ ttXlnl m0,if '" ° f m ° ,if> " reftrS '° a " y rati0n8 »y «quence or 

30 ombmat.on o sequences in which the sequence*) are chosen based on a three-dimensional configuration o e l 

P^e.n targe, mot.fs tnc.ude. but are no, limited to. enzymic active sites, structural subdom.ins, epitop:s ^ 0 
domams and s.gna, sequence, A variety of structural formats for the input and output means can be used , " a n 
output the information in the computer-based systems of the present invention ' 

means - VT °"~V " * ~" '° ^ ' ° r ~* ~«™* «* - Horace 

m an s o .dennfy smtctura. mot,fs or interpret e.cctron density maps derived in part from the atomic ccrdina.e/" 

ri: : : anisan can readi,y recognize that any ° ne ° r the pub,ie * ^ 

programs can be used as the search means for the computer-based systems of the present invention 

One apphcauon of this embodiment is provided in Figure 22. Figure 22 provides a block diaeram of a 

z ,er r m 102 : hat can be used *° imp,emew ,he presem ^ Th « — *«» 

P~»106 connected toabus .04. Also connected to the bus ,04 areamain memory m^y^Z 
as random access memory. RAM) and a variety of secondary storage memory 1 10, such as a hard drive 1 12 a JZ 
s,ora S e medium 1,4. and a moni.or .20. The removable medium storage device .Umavrl 7 

45 d,s, a compact d,sk. a magnetic tape. e,c.) containing con.ro, logic and/or data recorded therein ma! * 
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the removable medium storage medium 1 14. The computer system 102 includes appropriate software for reading the 
control logic and/or the data from the removable medium storage device 114 nee inserted in the removable medium 
storage device 1 14. 

Amino acid, encoding nucleotide or other sequence and/or atomic coordinate/x-ray diffraction data of the 
5 present invention may be stored in a well known manner in the main memory 108, any of the secondary storage devices 
1 10, and/or a removable storage device 1 16. Software for accessing and processing the amino acid sequence and/or 
atomic coordinate/x-ray diffraction data (such as search tools, comparing tools, etc.) reside in main memory 108 during 
execution. The monitor 120 is optionally used to visualize the structure data. 
Structure Determination 

10 One or more computational steps, computer programs and/or computer algorithms are used to build a molecular 

3-D model of a PPCA or pPPCA, using amino acid sequence data from Figures 13-15 (or variants thereof) and/or atomic 

coordinate/x-ray diffraction data, as presented herein. 

In x-ray crystallography, x-ray diffraction data and phases arc combined to produce electron density maps in 

which the three-dimensional structure of a PPCA or pPPCA is then built or modeled. This structure can then be used 
15 for RDD of modulators of at least one PPCA- or pPPCA-related activity that is relevant to at least one PPCA- or 

pPPCA-related pathology. 

Density Modification and Map Interpretation. Electron density maps can be calculated using such programs 
as those from the CCP4 computing package (SERC (UK) Collaborative Computing Project 4, Daresbury Laboratory, 
UK, 1979). Cycles of two-fold averaging can further be used, such as with the program RAVE (Kleywegt & Jones, 

20 Bailey et al.. eds. % First Map to Final Model. SERC Daresbury Laboratory, UK, pp 59-66 (1994)) and gradual model 
expansion. For map visualization and model building a program such as M 0" (Jones (1991), infra) can be used. 

Refinement and Model Validation. Rigid body and positional refinement can be carried out using a program 
such as X-PLOR (Briinger (1992), infra), e.g.. with the stereochemical parameters of Engh and Huber (Acta Cryst. 
,447:392-400 (1991)). If the model at this stage in the averaged maps still misses residues (e.g.. at least 5-10 per 

25 subunit), the some or all of the missing residues can be incorporated in the model during additional cycles of positional 
refinement and model building- The refinement procedure can start using data from lower resolution (e.g., 25- 10A to 
10-3.0 A and then gradually extended to include data from 12-6A to 3.0-1.5 A. B-values (also termed temperature 
factors) for individual atoms can be refined once data of 2.8A or higher (e.g., up to 1 .5 A) has been added. Subsequently 
waters can be gradually added. A program such as ARP (Lamzin and Wilson, Acta Cryst. D49:\29-\47 (1993)) can be 

30 used to add crystal lographic waters and as a tool to check for bad areas in the model. Programs such as PROCHECK 
(Lackowski etaL, J. Appt. Cryst 26:283-291 (1993)), WHATIF(Vriend, J. Mot Graph 5:52-56 (1990)) and PROFILE 
3D (LUthy et al. Nature 3J<5;83-85 (1992)), as well as the geometrical analysis generated by X-PLOR can be been used 
to check the structure for errors. A program such as DSSP can be used to assign the secondary structure elements 
(Kabsch and Sander (1983), infra). 

35 The structure of a PPCA or pPPCA can thus be solved with the molecular replacement procedure such as by 

using X-PLOR (Briinger (1992), infra). A partial search model for the monomer can be constructed using a related 
protein, such as wheat serine carboxypeptidase structure (Liao et al. (1992), infra). The rotation and translation function 
can be solved to yield orientations and positions for the subunits in the crystal lographic asymmetric unit. This allows 
phases to be determined that, when combined with information from the x-ray diffraction patterns, allows electron 

40 density maps of a PPCA or pPPCA to be calculated. The atomic model is then built using these electron densiry maps. 
Cyclical two-fold density averaging can also be done to improve the electron density maps using a suitable program 
(e.g.. RAVE) and model expansion can also be used to add missing residues for each monomer, resulting in a model with 
95-99.9% of the total number residues. The model can be refined in a program such as X-PLOR (Briinger (1992), 
supra), to a suitable crystal lographic R^ The model data is then saved on computer readable media for use in further 

45 analysis, such as rational drug design. 
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national Daig n of Drug, ,/,„, Jnleract with , he ppCA orpFPCA 

The determination of the three-dimensional structure of a PPCA or dPPCA „a i.^ • 
basis for the design of new and specif, ligands for the diagnosis and/or tre^ent In D ' ' 

related pathology. lament of at least one PPCA- or pPPCA- 

5 /7 ra ' approachescM ***enfor^ . , 

of hgands of this protein. A computer-assisted, manual elation of the active s te stZr^ T 
use of software such as GRID ( Goodford, J. Med . a*, ^849-857 (^?JTT T " *** ^ 
faction sites between probes with varjous c J ^ 

«.y* the active site to determine sutures of inhibilg compou ^ „ I T, " ^ " 
10 '» h '^o Up sonmo,e^^ 

around accessible positions a, suitable energy contour leve.s. Suitab.e Jnds J h • ° *"* 
compounds or compositions are then t«t e rf tor™*,. ■ ■■ lnl »°'»ng or st.mulat.ng modulating 

p "ons. are then tested for modulating activities of at least one PPCA or dPPPa 

A d «-"he ra peu,^ 
l.m.ted to, a, leas, one selected from a nuc.eic acid, a compound, a p ro ,e „ an e. me T^' " ~ * " "* 
15 an isotope, a carbohydrate, an imaging agent, a apoprotein a g ^2Z tZ T T^' ' ""^ 
or fragment thereof, or any combination thereof, wh ch can be det^ Mailed T' I "* 

* are not limited to. Wic .abels. nJ^^^'J^^ ^ "* 
compoundsor metal, chemiluminescen, compounds andbio.ulescem^ 
2q diagnostic or therapeutic agent can be used in a method of the in^ ' ^ 

After preliminary experiments are done io determine the K of rh* e„k . . * i_ 

Henderson (Biochem. J. now r„ , \ determined, (e.^.. by the method of 

a^-incubatedinbuff, 

timecou^eandeach <m^i*^to*^JZ£^ ™ed over a suite b, e 

25 aqueous ethanol). The concentration of product is detenl Z « * °" ^ ^ hydroxide in 

-Wcence a gainst time can be cto. J^tS^'^"?*' ^ ' " *" ° f 

Tor standard regression analysis (Leatherbarrow, Trenas Biochem . ^"I^Z^TJ " maPPr ° Pria " 

0' J *{E i -Q - / )r - £ = o 
where (using the notation of Henderson (Biochem. J /J7:32l-333 (1972)): 

This equation is solved for the positive root with the constraint that 

of Jess than 1 0 V ^ * ' where there 15 a cha *ge m less function 
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Once modulating ligands are found and isolated r synthesized, crystal! graphic studies of the compounds 
complexed to a PPCA or pPPCA can be performed. As a non-limiting example, PPCA or pPPCA crystals are soaked 
for 2 days in 0.01- 100 mM ligand and x-ray diffraction data are collected on an area detector and/or an image plate 
detector (e g., a Mar image plate detector) using a rotating anode x-ray source. Data are collected to as high a resolution 
as possible, e.g.. an inner limit of diffraction of 1.5-3.5A. An atomic model of the inhibitor is built into the difference 
Fourier map (F Mtmamtia -F m „). The model can be refined to adjust the atomic positions to improve the fit with the 
electron density maps, while maintaining correct stereochemical constraints. The model will preferably have low r.m.s. 
deviations from the ideal bond lengths, as well as for the angles, respectively, as well as a low R-factor (preferably less 
than about 25-35%, such as less than about 35, 34, 33, 32. 3 1, 30. 29. 28. 27. 26, or 25%. 

Direct measurements of enzyme inhibition provide further confirmation that the modeled ligands are 
modulators of at least one biological activity of a PPCA or a pPPCA . As a non-limiting example, a modification (Chong 
elal.Biochim. Biophys. Acta 1077:65-7 1 (1991)) of the fluorometric assay of Potier (et aLAnalyt. Biochem. 9*287- 
296 (1979)) is optionally used to measure neuraminidase inhibition or stimulation, optionally including determination 
of inhibition constants (*,). Other suitable PPCA activity assay include, e.g.. cathepsin A activity (Galjart e, al.. J. Biol. 
Chem. 266: 14754- 14762 (1991); Endothelin I deamidase activity (Jackman, et al., J. Biol. Chem. 257:2872-2875(1992); 
and uchykinindeamidase activity (Jackman. el al.. J. Biol. Chem. 26S:\ 1265-11272 (1990)). 

Ligands of a PPCA or pPPCA. based on the crystal structure of this enzyme, are thus also provided by the 
present invention. A PPCA or pPPCA ligand is any molecule, compound or composition that is capable of associating 
with a PPCA or pPPCA and optionally modulating at least one function or structural feature of a PPCA or pPPCA 
20 Preferably, a PPCA or pPPCA ligand modulates at least one biological activity of a PPCA or pPPCA. Demonstration 
of clinically useful levels, e.g., ifivw activity is also important. In evaluating PPCA orpPPCA inhibitors for biological 
. activity in animal models (e.g., rat. mouse, rabbit) using various oral and parenteral routes of administration are 
evaluated. Using this approach, it is expected that modulation of a PPCA or pPPCA occurs in suitable animal models, 
using the ligands discovered by structure determination and x-ray crystallography. 
5 Evaluation ofTlterapeutlc Potentials of Compositions via a PPCA Animal Model 

The present invention also provides methods for identifying diagnostic or therapeutic ligands of PPCA or 
pPPCA via computer RDD, to treat a PPCA-related pathology. Generally, a method for determining the therapeutic or 
Agnostic use of a PPCA or pPPCA modulating ligand, to treat a PPCA related pathology, comprises the steps of 
admm.ster.ng a known dose of at least one ligand containing compositions to an animal model having a phenotype 
corresponding to a PPCA-related pathology, monitoring the appropriate biological or biochemical parameters and 
comparing the results with treated animals to those of untreated animals. Results indicating the onset or presence of a 
PPCA related pathology are generally referred to herein as "symptoms- of the disease. See.. e.g, U S Appl No 
08/397.693, filed March 2. 1 995, which is entirely incorporated herein by reference. 

Appropriate biological and biochemical parameters that reflect the onset and progression of a PPCA related 
pathology include, but are not limited to, (1) gross biological parameters, e.g.. physical appearance (i.e.. flattening of 
the face, rough haircoat and/or subcutaneous swelling in affected animals) or growth (reduced weight gain)- (2) gross 
behav,o ra i parameters, e.g.. lack of coordination; (3) biochemical assays, e.g.. assays of caihepsin A. N-acetyl-a- 
neuram.n.dasr or P-galactosidase activities in primary cultures of skin fibroblasts or tissue homogenates (4) 
histopatholo,..:,! studies (visceromegaly, i.e.. enlarged liver and spleen: accumulation of secondary vacuoles in kidnev 
i tissues: etc.). 

A first method of evaluating the therapeutic potential of a composition using the transgenic non-human animals 
of the invention comprises the steps of: 

(1) Administering a known dose of the composition to a first non-human animal havina a 
phenotype corresponding to a human PPCA related pathology; 

(2) Detecting the time of onset of symptoms in the first non-human animal: and 
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of symbol • C ° mP 7 8 T time ° f 0nSW ° f W ' H " ,n f ' m n ° n - hUman animal * *- of onset 
of ymptoms ,„ a second non-human animal having a phenotype corresponding to . human PPCA related 
pathology, which has not been exposed to the composition- " 
wherein a sutisticaUy significant deiay in the time of nset of symptom, in the firs, non-human anima. relative to the 

.nven,ionr; p ll~^ 

10 ohen , 0) Adn "' niSKrine 3 kn ° Wn dMe of ,he «*»P0*ta» » a firs, non-human anima. having a 

10 phenotype corresponding to a human PPCA related pathology a, an initial time, W ' 

(2) Determining the extent of symptoms in the first non-human anima. at a later time, t,- and 

(3) Comparing, a, t„ the extent of symptoms in the firs, non-human animal to the extent of 
Wm. m a second no,human animal having a phenotype corresponding to a human PPCA related 
pathology, which has not been exposed to the composition a, ^ 

•he polypeptide is present on the surface of the bacterium or virus prior to Z^Z 7 ' ^ 

c^atory injection of an iso.ated or recombinant ^"Z^SZZ^^*"" * 
°"'-Und the Peptide ^^^^ 

human ^"T !"* *** * * h " ««* « can then be tested in 

human c.m.cal trials, according to known method steps. 

30 lntheabov e methods, delivery of the composition beine tested to non-h..™.- • . • 

■"Ofeofthecomposmons or ofa liposome (Rahman and Schein in iwr • - ° n0,Oneor 

3 5 ro j,. coZzteT: 1 ::? 'j^^^szr ' r 5 (sep - i7> i985)) 

^ 0 „, ^ mcfl/&TO I8tn E ,. Gennaro ed _ ? > - * • genera.ly, 

Using compounds or compositions comprising at least one PPCA or PPr a m ^ i • ' , 
invention further provides a method for moduJng the activity J!^ "££^ " ? T T ^ 



WO 97/15588 



-21- 



PCT/US96/17325 



10 



15 



20 



protein in vivo. The contacting of such a cell with such a ligand results in the in vivo modulation of at least one 
biological activity of a PPCA or pPPCA. 

At least one PPCA or pPPCA modulating compound or composition f the invention can be administered by 
any means that achieve the intended purpose, using a suitable pharmaceutical composition or formulation. For example, 
administration can be by various parenteral routes such as subcutaneous, intravenous, intradermal, intramuscular! 
intraperitoneal, intranasal, intracranial, transdermal, or buccal routes. Alternatively, or concurrently, administration can 
be by the oral route. Parenteral administration can be by bolus injection or by gradual perfusion over time. 

A typical regimen for treatment or prophylaxis comprises administration of an effective amount over a period 
of one or several days, up to and including berween one week and about six months. It is understood that the dosage 
of a diagnostic/pharmaceutical compound or composition of the invention administered in vivo or in vitro will be 
dependent upon the age, sex. health,, and weight of the recipient, kind of concurrent treatment, if any, frequency of 
treatment, and the nature of the diagnostic/ pharmaceutical effect desired. The ranges of effective doses provided herein 
are not intended to be limiting and represent preferred dose ranges. However, the most preferred dosage will be tailored 
to the individual subject, as is understood and determinable by one skilled in the relevant arts. See, e.g.. Berkow et a l. 
eds.. The Merck Manual, 16th edition, Merck and Co., Rahway, N.J., 1992; Goodman et al., eds.. Goodman and 
Oilman's The Pharmacological Basis of Therapeutics, 8th edition, Pergamon Press, Inc., Elmsford, N.Y., (1990); Avery's 
Drug Treatment: Principles and Practice of Clinical Pharmacology and Therapeutics. 3rd edition. ADIS Press, LTD 
Williams and Wilkins. Baltimore. MD. (1987). EbadL Pharmacology. Little, Brown and Co.. Boston. (1985); Osol etai. 
eds.. Remington's Pharmaceutical Sciences, 18th edition. Mack Publishing Co.. Easton, PA (1 990); Katzung. Basic and 
Clinical Pharmacology. Appleton and Lange. Norwalk. CT (1992). which references are entirely incorporated herein 
by reference. 

The total dose required for each treatment can be administered by multiple doses or in a single dose. The 
diagnostic/pharmaceutical compound or composition can be administered alone or in conjunction with other diagnostics 
and/or pharmaceuticals directed to the pathology, or directed to other symptoms of the pathology. Effective amounts 
of a diagnostic/pharmaceutical compound or composition of the invention are from about 0.1 ug to about 100 mg/kg 
body weight, administered at intervals of 4-72 hours, for a period of 2 hours to 1 year, and/or any range or vafue therein. 

The recipients of administration of compounds and/or compositions of the invention can be any mammals 
Among mammals, the preferred recipients are mammals of the Orders Primata (including humans, apes and monkeys) 
Artenodactyla (including horses, goats, cows, sheep, pigs). Rodenta (including mice, rats, rabbits, and hamsters), and 
Cam 'vora (including cats, and dogs). The most preferred recipients are humans. 

Having now generally described the invention, the same will be more readily understood through reference 
to the following example which is provided by way of illustration, and is not intended to be limiting of the present 
invention. 

Example 1: Preparation, Purification and Crystallization of PPCA or pPPCA from Human 
> Cells 

The present invention provides, in one aspect, the determination of the three-dimensional structure of the human 
protective protein/cathepsin A (PPCA) in the precursor form (pPPCA) by a combination of molecular replacement and 
twofold density averaging. The structure presented here is the first of an enzyme associated with a human PPCA related 
pathology, and the third human lysosomal enzyme structure determined. The structure gives us insight into the zymogen 
activation mechanism of pPPCA . as well as the expected 3-D structure of PPCA and its specific and new enzymatic 
activities. 

PPCA anilpPPCA Expression and Purification 

Plasmld Conducts. AcMNPV transfer-plasm ids P JR2 and P BC3 (Figure 1) were derivatives of plasmid 
P Ac373. carrying the entire polyhedrin gene (Smith et al., 1985). In pJR2 a polylinker with a number of multiple 
clonmg sues (MCS) was inserted directly 3' of the polyhedrin promoter, and substituted a 33-nucleotide deletion of the 
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polyhedrin gene, starting with the ATG. pBC3 had the polylinker situated i„ , • ■, 

*e 33- n , deletion , his plasmid featured m ATQ ^SJJ^CC 27 " ^ " ^ °' 

Cher i„ P m or pBC3 as EcoR, fragments, using stand J pro elrT S 1 T " ' l99,) ' 
5 d e,«io„ mutant wa 5 tagged with , he human ^cT^ZlT" 7 ' ^ ° ^ 

were cu.tured in monolayers „ ^ , ^ ^^J^" 0PLB-SF21) 
(compiete medium). Wi.d-type ( w„ AcMNP v virus Li„ * 1 7 Ts ' ,0H 

-ACPPC A2 0 we, generated hyeotransfectingS.icei.sw 

u S .ng the calcium phosphate method, modified for insect cells rr»h , ° M8 P ' aSmid DNA ' 

^"^verifie^ 
15 were produced by infection of insect cells at 25-50 ft conflue j % ' ° f ' n0Culum 

(MOI) of < . pfu/cell. After 3 .0 6 days at 27«C when aZi ° mbman,V,rUSat 8 """"P 1 -^ of infection 

preparat.on were resolved on 12.5% SDS-polyacrvtamide ..L ., *~ 1 ( 1 Alu,U0B of P uri ^ 

from gels to IMMOBILON PVDV membranes (Millioore i ■ * P™" 15 WW! nns{ ™* 

(am,no acid 285-298. Ca.jan . «. I988 , was sl^ed 0 ! H """"" »** PPCA Subu ™ 
cova,„tly .i^edtotne carrier P^^xIT^^ T ^ Bfa *-™* " 

CON^o^,,^ Po.ycJ.antibX^^ 
35 subdermal injections of the protein (40-125 ug) mixed with i„c„ T"T *" ™ ed ta rabbit > b > """'h* 

* after the Hrs, injection. The ^S^T^^i^ 
■mmunoprecipitations of baculovirus produced PP CA * °" immUn0b, * ts «* "y 

4n ^-::;ir> d a:^^^ 

40 ^ingbuf^^ 

P osp a tase conjugate an| , rabbj[ , g0 w blocking b f^r ^r mCUbated 2 h Wi ' h alkali " 

Phosphatase "b^e (Sigm, 4-ami„odip„^^ 

C™a,U wionof „ CA . Practionscontaining meprecuZfo™ * 

gel were poo.ed. Subsequently the protein was cbJLJZT, ! Pr ° ,e ' n " ^ 0 " an SDS " p AGE 

45 ^ r 50mMMESp„, 5usi J C ^^ 

Crystals were grown usmg the hanging drop vapor diffusion 
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5 5 mg/m,. Crystal gr o W « h was enhanced „ mau °™™" ^ prote '" before mixing was 

-^-b, and in some cases by ^ wd LJ^^^ - Promotes growth of big 

2.- 5/r«««r e Determination o/apPPCA Crvuallir.H , „ 
C«, * fl/a R J JJ n CA C ^""^ front Hu man Cells 

-ed as an artifici ,, mother , jquor ^ £ pH 8* „M NaN) , 0 , 3% ^ ^.^ 

OUta* (.993). ,„ >fl) was used for ^ 2^7™ * ,9?9) ' ^ PTOg - KEFJX 
20 Com putmg p rojec( , Daresbury Laboratory UK ,979) I ™ (UK) Coilabora.ive 

CACROVATA, ,hen convened to ,m P H„Jan d ^ w h £7" ™*« 
cCleaedaregiveninr^ ,. 7* e V „ (Matthews, B.W. 7^ J^™™** ^ ° f *< - 

Molecular Replacement 
^^encea, ignmen , w ^^^^ 

l nd ' ltt,e ^ .he ,ap- domain (see Fig re ^ T^' " 8 ' ° f th "° re 

a Symmem , un , .naddi,o, th e„ ative J f ^^ ^ »»""■> bating nvo monomers in * 

. -««, together. ^ cross-rotation f U „c,ion waTcal ^7°™ " ** »«« «W» no, 

7™~<*fo,.ow^ 

the ro tat ,o n flmction accord (o Rm TJ« ro,a, ona, spaC e restricted ,0 the asymmetric u„i, of 

in angular grid intervals of 2.5° The 500(1 hi-k 

j ne jULKJ highest rotation 
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func.ion grid points were selected resulting from the product function of the two Patterson vector sets. The grid points 
(d.ffer.ng less than 8' around any given axis) were then clustered. The result was a lis, of 169 possible solutions for 
thero.anonrunaion.eachcorrespondingtoasetofthreeanglesdescrib^ The two top solutions were 

3.9 and 3.8 s.gma above the mean. PC-refinemen, (BrUnger. 1990) was carried out to optimize each of the .69 possible 
solut^susmgmecompletesearchmodelasasinglerigidbody. This yielded two orientations with a PC-index of 0 043 
and 0.051 respectively. The orientations of these solutions were (D, = 261.4, 0,-36^2 D, - 147 Jgv and (D - 
HJ2£ - 47.40. £>, - 23.22), respectively. In contrast, the rest of the possible solutions yie.ded an average POindex 

for each l^T ™ °* " ' A ** A transl " ional **** ™ found 

for each or, ntat.on a, posmons (x=33.30, y-5 ..97, and z=,2.79) and (x-25.23, y=28.58. and z=22.02'. with respect 

55.6% and 54.8% B me resolution ^ , 0 t0 ^ ^ , ^ ^ 

•n an R of 52.8 /. for data between 8.0 and 4.0A. bringing the R fM . down to 51.3% and increasing the CC to 0 22 
The molecular packmg was assessed on a graphics workstation, wh.eh revealed no Cashes between the placed search 

conu n d JZl ^ ,ar§e am ° Um ° f SPa " WM PrMem - PaCki " 8 *"*" «« ic unit 

onta.ned two half d.mer, each forming a dimer with another monomer in a neighboring unitceU. Thl- two cores in 

thc^ymmetnc un„ were related by x=73' around an axis tilted ,5.5' off the crystal.ographic . axis lying in the a, 
20 Iterative Model Building and Two-fold Averaging 

F^l s and phases from the molecular replacement solution. The map was contoured at 1 o and showed good density 
that the molecular replacement solution was correct. naicaiing 

of the ppST *^ ^ r0tate<1 ""' ffanS,i " ed PTObeS **" thC «"*« •»* «" building 
of th PPCA precursor. Th. non^sU.lographic symmetry (NCS) matrix was determined between the two cores usin 

uoeil k °?™ m *<™^^°V™°<<>'-. "91). Subsequentlya-bestmonomet'wasbuiltby 
supenmposmg the eleoron densities from each monomer core, and adjustmg the mode, accordingly. Residues were only 

mode, for wh.ch no dens.ty was visible were removed. An alanine was bui.t in the mode, at places where electro 
densuy for a side chain was partia.. .n this manner 294 residues, i.e. 65% of the C- atom, were buil, in the 'best 
-nome, core. The second monomer wa, generated from the 'best monome, mode, using the NCS operator relating 
two monomers m the asymm«ric unit. A, this point the data se, was partitioned in a working set and a test se 
cons.snng of 5% of the reflections between 8 - 2.2A to monitor the R„. (BrUnger e, al. 1 992b). The working data set 

IT :f ** P ° Si,i0nal ref,nemem - ^ ^ a " d ™> - -Panitioned d^ " 

Twentv.fi cycIes of refinement ^ the ^ ^ ^ 

P3n.a.mode,wereusedtocalc U ,,,eanew2m,F el .|-D,F ok | SigmaA weighted map which we called the 'best mon^ 

Averaging: SearCfor Missing Dens*, The phasing power from the rigid body refined 'test monomer 
«« . cons.st.ng of 294 residues per core was insufficient to bring back interpretab.e electron density for .he mi sTn 
part of the mode.. ,58 residues per monomer. To overcome this a 'bootstrapping' procedure was Spiled J 
^^^RAVE,^^ ,994a)andmode.expansion. The ,cs, monomer mT, a d h ^ 
body refined 'best monomer cores' served as the starting point for this procedure. 
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Six bootstrapping cycles were carried out, called bmc I thr ugh bmc6, allowing for the model to be extended 
in stepwise increments. Figure 1 6 sh ws a scheme of the steps incorporated in one bootstrapping cycle. After a cycle 
in which the model had undergone major expansion, a new molecular mask was calculated with MAMA (Kleywegt & 
Jones, 1994b) for use in the subsequent bootstrapping cycle. No phase recombination was applied between 
bootstrapping cycles. At the end of each cycle the inverted phases a m and inverted amplitudes F„. "s were discarded 
The NCS operator was re-optimized after cycle bmc3. The resolution range of the data included in the bootstrapping 
cycle started with 15 - 3.0 A for bmcl and was gradually extended to IS - 2.7 A in bmc6. The bootstrapping procedure 
is summarized in Table 2. To optimize the bootstrapping procedure, consideration was given to the molecular mask used 
in the averaging, the model building strategy and the refinement procedure. 

Molecular masks: Four different masks were constructed in total. The atomic radius of all atoms was set to 
4A to calculate each mask. The masks were then manually modified using mask editing options in 0 (Jones cat. 1991) 
Mask I. was constructed around the 'best monomer core 1 . Subsequently it was greatly enlarged by multiple blocks of 
10- IS A J in the regions where the model was incomplete (Figure 1 7). This was crucial to prevent the density in the 
insemon area's from being flattened during the averaging step. Approximately one half of the dimer interface was 
estimated to be formed by regions from the missing cap domain. Major expansions of the mask in this area were made 
to accommodate for this. This resulted in a serious overlap problem when the mask was duplicated to cover a complete 
d.mer. The mask was reduced where overlap occurred with the "overlapjrim" option of MAMA. After several 
bootstrappmg cycles, new incorporated polypeptide fragments were carefully assigned to one of the two monomers 
forming the dimer and the mask at the dimer interface area's was manually adjusted accordingly. Essentially the masks 
were kept far too large in regions where the model was missing in order to avoid erroneous flattening of electron density 
In contrast the masks were tightened around the area's of the molecule where the model was complete. 

Model Bunding: A conservative model building strategy was adopted. Initially only side chains were mutated 
m the core region to fit the PPCA amino acid sequence and where the density was clear, poly-alanine fragments were 
bu.lt m the insertion area's (loops and the cap domain). Newly included atoms were given a B-factor of 20 A : Only 
once models bmcS and bmc6 were obtained, was the electron density of sufficient quality to allow side chains to be 
incorporated confidently in the cap domain (residues 190 - 303). At this stage the C trace was virtually complete for 
the whole dimer and the sequence could be fit unambiguously. 

Refinemeni: Positional refinement was postponed until after 3 cycles of bootstrapping resulting in a final 
model containing 91% of the C- atoms. For* steps of positional refinement were then carried out to improve the 
geometry of the model. Subsequently only one of the refined monomer was taken and the other generated using NCS 
operators. The rational for delaying the positional refinement is addressed in the discussion. 

Completing the model: deviations from Infold symmetry. It was possible to add 148 residues and 185 side 
ehams per monomer after a total of 6 bootarapping cycles. At this stage, each subunit contained 442 residues and 4 1 3 
side chains, i.e. 98% of the C and 91% of the side chains atoms. The gradual mode, expansion as a function of the 
bootstrapping cycle is shown in Figure 1 8. 

Twenty residues were still missing in the asymmetric unit at this stage. These were localized to two stretches 
permonomer(260 - 262a nd 287-292). With most of the scattering mass incorporated, the monomers from mode. bmc6 
was refined individual with X-PLOR (BrOnger. ,992a) in an attempt to retrieve electron density for the sti.l missin. 
residues. After 40 steps of positional refinement using data from 8.0 - 2.6 A. me dropped significantly from 40 2% 
•o 31.2 ... THe model was further positionally refined using a full weight W A on the crystallograohic term The data 
included in the refinement was gradually extended to Z2 A. At 2.4 A resolution individual B-factors were refined and 
the attribution checked as a function of atom location (/.e. low B-fac,ors in the core and high B-factors on the surface) 
Cycles of refinement and refining allowed for 18 missing residues to be added. Essentially almost the complete cap 
domain was retrieved using the bootstrapping procedure, as shown in Figure 19. ., became apparent from the refined 
maps that the two streiches of missing ammo acids adopted a very different conformation in the two monomers (whh 
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* much as an average r.m.s.d. of 7.9 A for the Cs of residues 287 . 29^> For thi, „ 

regions had no, been relieved in the twc-fo.d averaging process Ue 2? J™ deC,ron densi * f « *m 
maps a.ong with averaging, mode, expansion and refinement is T"*™ " *» 

weakest data as well as an anisotropic scale factor to offset thTdecre" 7 * *»" '° ^ ,0% of 
The fina.mode.isof good « with a J ^ ^ 

Table 3). A Ramachandran plot is riven in Ficure 7 tT > be,WWn 8 0 and 2 2 A (»' 

(Read, ,986). The average ph* Zc b ^een ^e ^aH * ^ " """'^ * ^ 

1 0 .ode, is ca.cu.ated to b^H for data Te"weenTo - 22 A ^ * <* ^ 

««, Three - J,™ 11^^"' "T - ^ " 

pPPCA and lite Hydrolase Famil, >. The fold of oPPCA . 
enzymes such as the serine carboxypepridases dehll & , *' fM con ' ai " in 6 

(-^.^UavingvariousdifTeerr^ 
^-obo^^^^ 

"we.l as size (Figure 7A-F). pPPCA has one of the .a^est cap dom 17" ^ rapeC "° f °' d 

30 

-.er^^T^ -overa„f oI dof,epPPCA 

C« atoms and 3 8% seance identity. De^ng ^^^7 " ^ ^ «* 1 7 * for 302 
35 superimpose wi,h an r.m.s. deviation of 12 A onto CPW and CPY , ^ PPPCA ,0 

C™pPPCAand 2 7^.va,e„t^^^ 

The cap doma.n in pPPCA differs significantly from the CPW and ppv 
-a, a Urge maturation subdomain no, present in the struck o f C W ^7^7^ ^ 
enzymatically active forms are known All three enrv m ■ Wh ' Ch ,he structur " °f the 
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Superposition of the core domain of one monomer from each dimer show, tha. the second pair of monomers (form™ 
•He respecve dime*) differ by a remark, ,5- in orientation (Fi gU re 8B). Thus, it appears ha 1^ 
deferences ,„ the cap domains ,ead to a different arrangement of the subunits h th< dimers cTpPCA and CPW 

Carafytic Triad and Enzymatic Mechanism. Our stn.ct.re show, mat the precursor PPCA has all the elements 
IT H enZymatiC maChinery ° f ^ !erinC <«»™»~ ^ - - Hm, ,fr,. End -I 3 

1,50 r IT " aC,,Ve Site ° f PPPCA " ^ ^ Sef 1 50 ' »« «9 and Asp 372 The 0 r 

of Sr. 50 forms a good hydrogen bond with the N'l of His 429 with aNtoO distance of 2.8 A TheN'l of 429 

is 27 A removed fr0m ^ and 33 A from the u '' of Asp 372. Further, two backbone amides appear to orient the 
10 car^e group of Asp 372. The N of Ala 374 is at a distance of 3.0 A to the 0" of Asp 372 and! N of Cys 375 
at 3 distance of 2.9 A to the 0" of Asp 372. y 
The oxyanion hole proposed to stabilize the negatively chareed terrah^i 

snrxu in irw and CPY, in contrast to the planar orientation found in subtilisin- and t™«,„ ~ 
proteases (McPhalen et aL Biochemistry 27:6582-6598 (1988)). "btitam-^nd trypsm-tvpe senne 
20 In pPPCA, a pair of glutamic acid residues (Giu 69 and Glu 1 49) is no^itinn^ u , • . 

opposm to each other, may become depro.ona.ed resulting in unfavorable electron J 
- ending pane, or res. m « P™™^ 

Urates a, high pH. ■„ pPPCA the orientation of «h« pair of g ,uu,mic acids as we ^7.1 L ^ 
^de,ermined,pH 8 .TheCPWa„dCPVs,ruc, U resha V ebee n deteJ n : a ^ 

structure appears to mle out large pH induced conformational chanttes of these three r s d l - " 

13 -itsabovethat optimal for carbo.ypeptidase activi^. Howeveuh htgh .^Z 

does indicate some ro.e in a characteristic shared bv al. three enzymes ° f "~ °f — residues 
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pi' vJ^^fT"**"* PPCA ' Prcfere " Ce hydr0ph0biC r " id - in th < « "d/or 

P b.nd.ng pockets (Jaekman ,«£. W>Won 2/:925.928 (1993)). In CPW the PI' pocket was idemif.ed ,o consist 

C lu 398) a. ,hc en (L,ao e, a, ( 1 992). ,„/„). This explains the highest preference of this enzyme f or Arg and Lys a 

^ITTtT'- CarisbergRes - Commm - * :297 - 3 ' 1 (,987 »- ,n CPY a poet, 

formed by the res.dues Thr 60, Tyr256, Leu 272 and Me. 398 (Endrizzi e, al (1994) .„ PPr a L , 

— 1> » and Asp 64, forming *e sides of the pocket with „ ,e f ar end ^43^ 3^ 

reasonably conststent with an overall preference of PPCA for a hydrophobic leaving group 

Nation Mechan^mcf the Precunor Form. During the maturation step of the PPCA precursor form 
10 max.mum 

and/o Arg 298. The res.dues forming the 'excision' peptide adopt distinctly different conformations in 
crystallograph.ca.ly distinct monomers forming the PPCA dimer in our crystal structure Ye, in bo. 2 T 

F.gure 9). Arg 284 and Arg 292 are part.cularly well exposed. The main chain atoms of Arg 298 are less accessible 
bom sandwehed between the srrand M P 2 and a loop N-termina. to helix C«6. while a sal, bridge with G 2 4 
.he side chain atoms of Arg 298 partially solvent inaccessible. ^ 

PPCA T^rr * V " bl<>Clted ^ rMidUeS fr ° m matUra,i ° n SUbd0main in PT form of 

PPCA. The cataly„c mad .s rendered solvent inaccessible by residues Asn 275 He ^76 and Phe 577 -n. T 

20 are par, of the polypeptide Asp 272-Phe 277 which we cal, the Peking p^ 

Leu 180. Leu ,90. VaM 9, . Leu 232. V„ 235. He 246, Leu 280. Leu 282. Me, 299 and A.a 373 (F T 10 'add V 

25 play a central ro,e in the strand b.ocking the active sit, A glycine at this position appears cri^ro lowt 
po.ypept,de^ 

'*ngtheblockmgpep,^^ ThePl binding , 

poss.ble m the precursor form due to the inaccessibility of the substrate binding pockets 

We conclude that the inactivation mechanism of PPCA is based on blocking of the active site and no, „„„ 
c^ges ,n the position of Actional groups involved in cata.ysisWition state stabLtio, Both P Tp2 d' 

c I- r ar ,r: red soivent inacces$ib,e - ^ func,ion ° f ,he *** ^ -» *> * * * 

-lync t nad as well as the region around the P. and P2 binding pockets solvent inaccessible. The b.ocking pe il 
however, does not assume a conformation thai a peptide substrate would adoo, It i, «r,fi.n ■ 8 pcphde - 

35 

-correctly poised for ^ 

cWl Hence, mere removal of ,he excision peptide a,o„e is not sufficient to a„ow so.ven, or substrate access Z 
40 of PPCA appears to be My b.ocked by 49 residues of the maturation subdomai, as shown in Figure 1 1 Z7oZ 

IZTzi T e and ,he comparison with cpw and CPY fc is proposed - * 

res, ues 254-284 rearranges to free the P.. P2 binding si.es. while ,he residues 299-302 rearrange to free the Pr h H 
Arg 298. The extent of the residues rearranging « Hke.y to be limited by a disulfide bridge Cys 253 and Cys 3otwh!l 
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is conserved in the serine carboxypeptidase family. This critical disulfide serves to keep the secondary structure 
elements together at the far end of the PI' pocket. 

An interesting pair of salt bridges is observed between Arg 262, Asp 300, Glu 264 and Arg 298, four residues 
located on strands Mpl and Mf3 of the mixed p-sheet found in the maturation subdomain. This cluster of residues is 
5 strategically positioned at the base of the excision peptide, close the core domain and 'shielding' the mixed P-sheet via 
side chain interactions (see Figure 11). These residues are strictly conserved among the human, mouse and chicken 
PPCAs (Gaijart etai (1991), infra). This charge cluster may be effected by a shift from neutral to acidic pH. Arrival 
in the endosome/lysosome is expected to result in protonation of either the Asp or the Glu residue or both, resulting in 
unfavorable electrostatic interactions and destabilization of this charge cluster. This in turn is expected to promote partial 

10 unfolding of maturation subdomain, allowing easier access to additional potential cleavage sites, and stimulating removal 
of the 'blocking' peptide which fills the active site in the precursor. 

A similar double salt bridge has been observed in the aspanic proteinase zymogen pepsinogen between the 
proenzyme segment (Arg 8P) and the enzyme (Arg 308, Glu 13, Asp 304). 

The maturation mechanism for pPPCA appears to be novel among proteases for which the three-dimensional 

15 structure of the zymogen is known. .The catalytic triad in the precursor form is in a catalytically competent 
conformation. Enzymatic activity is prevented by a 'blocking* peptide. The blocking peptide is however different from 
the excision peptide and does not get excised from the mature enzyme. This leads to the distinct difference with the 
other known maturation mechanisms in that, after disappearance of the excision peptide, up to 35 residues filling the 
active site cleft in the PPCA precursor must rearrange to render the catalytic triad solvent accessible (see Figure 12), but 

20 do not get cleaved off. Removal of the excision peptide, and possibly a shift to lower pH in the endosome/lysosome, 
appears to be a trigger for this event. The mechanism does not appear to be autocatalytic, as uptake experiments with 
cultured galactosialidosis fibroblasts, have shown that a mutant PPCA with the catalytic Ser 150 mutated to Ala, is 
properly targeted and processed. It retains its protective function and except for the loss of catalytic activity is 
biochemically indistinguishable from the wild type enzyme (Gaijart et ai (1991), infra). Surprisingly, the maturation 

25 mechanism of the serine carboxypeptidases PPCA, CPW and CPY may all differ from each other as well. This is 
clearest for CPY, in which a 91 residue polypeptide is cleaved ofTN-terminally to convert the zymogen to an active 
enzyme (Wimher and Sorensen, Proc. Natl. Acad. Set. USA 55:9330-9334 (1991)), as opposed to the excision of a 
peptide from within the zymogen generating a two chain active form as is the case for PPCA and CPW. 

Looking at the hydrolase fold family, the catalytic triad is housed in the core domain and the various cap 

30 domains attenuate the biological function by influencing entirely different properties such as: (I) enzyme kinetics 
exemplified by the interfacial activation of lipases (Smith etai, Curr. Opinion in Structural Biology 2:490-496 (1992)); 
(ii) substrate channeling as is proposed for acetylcholine esterase (Sussrnan et ai (1991), infra); (iii) substrate 
recognition, proposed for dehalogenase by (Franken et al. (1991), infra) and for CPY and CPW by (Endrizzi et ai 
(1 994), infra): and (iv) enzyme inactivation in the case of PPCA. 

35 Biological Implications. Deficiency of the protective protein/cathepsin A (PPCA) in humans results in the 

lysosomal storage disease galactosialidosis. PPCA is thought to form a multi-enzyme complex with P-galactosidase and 
neuraminidase in the iysosomes protecting the latter glycosidases in their harsh acidic and proteases-rich environment. 
PPCA has a 30% sequence identity to the wheat serine carboxypeptidase (CPW) and yeast serine carboxypeptidase 
(CPY). It has been show that PPCA in the precursor form is inactive, but upon maturation, entailing excision of a 2 kDa 

40 peptide, carboxypeptidase activity is released. 

The precursor structure reveals an inactivation mechanism that has not been seen before in any of the other 
known zymogen structures of proteases (available for the serine-, metallo- and aspartic protease classes). The catalytic 
triad seems to have an arrangement poised for catalysis. However, the triad is rendered solvent and substrate 
inaccessible by a strand from the maturation subdomain binding in the active site cleft. Surprisingly, this strand called 

45 the 'blocking' peptide does not overlap with the 2 kDa 'excision* peptide. Hence, after removal of the excision peptide 
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up to 35 additional residues must rearrange in order to unblock the active S ir e ^ ft . 

Sa " "*■»■ Ar g 262. Arg 298. G.u 264. and Asp 300 at the bal the ** * 

and/or promoting tf,e rearrangement even,. ""^"g eas.er access to cleavage si , es 

™re to be determined, mav prove usefu, in reture ^Js of ZtZT™ *' * W ^ ,ySOSOrna, 
Ration from the three-dimensional structure or PPCA m h ^ ^ "* **^'*o- 

with increased stability and a longer half-iifc. S ' 8n ' ng *" «"6ineered form of PPCA 
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5 



resolution 


32.27-2.2 A 


wavelength 


1.08 A 


space group 


P2.2.2 


unit cell 


a=l 15.04 b =148.1 1 c=80.97 A 


temperature of data collection 


-178°C 


No. of observed reflections 


436,709 


No. of unique reflections 


67,740 


completeness of all data 


95.7% 


R,^ for all data 


5.1% 


completeness of outer shell 


87.0% 


(2.26-2.20A) 


13.0% 


R,^ in outer shell (2.26-2.20A) 




^ym-IEUnJ-^WVJX Jj(h), where I,(h) is the i* observation for reflection h 
and <l(h)> is the weighted mean of all the observations. 
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Model 



nr. f 


III . 1 5iQC 




Rfact r 




CC 


1 cc, m 


C-s 


chains 


(io 4 A') 


{statistics using data between 8.0 and 3.oAl 


331 


125 


- 


54.2 
52.6 


55.3 
52.9 


0:243 

0287 


0.244 
0.318 


294 


228 


• 


55.9 
53.5 


57.4 
55.0 


0.228 
0.320 


v£ 1 O 

0328 


j / J 


258 


10.8 


49.9 


51.3 


0.403 


0.424 


405 


277 


10.8 


48.6 


48.4 


0.443 


0.478 


411 


307 


9.99 


47.1 
46.9 
39.4 


48.6 
48.4 
44.7 


0.471 
0.476 
0.622 


0.491 
0.492 
0.562 


412 


327 


10.8 


41.7 


43.1 


0.584 


0.585 


435 


387 


8.88 


39.8 


40.6 


0.621 


0.623 


442 


413 


9.11 


38.4 


40.2 


0.647 


0.637 



mot, renl fmr) 



rigid body ref. (rmr) 
calculate NCS matrix 

best monomer ffrm) 



rigid body ref. 
update NCS matrix 

lHQ£l(mask 1) 

km£2(mask 1) 

km£l(mask 2) 
rigid body ref. 
positional ref. (pbmc3) 
update NCS matrix 

imcl(mask 1) 

bmc5 (mask 3) 

kO£fi(mask 4) 



the modular rep.ac^^ 
the bootstrapping cycle, bmel through bmc6. ■£ fcEXKStoS t?« ^LT"*? " d 
the number of C atoms built per monomer the number ofc^r^S! v. 8 °* Ae vsnous modds: 
and the volume of the moieX maslS Z^T t £S?u " ?T m i°n»™ ted ^ mOT0 ™ 
is assessed using the R,_ C C £55, ^^J! *> op^'''™' qUa,i,y of each mode > 
After position., "f.ne^mode^ 

and generating the non-crystallographically related one. equivalent by taking one monomer 



WO 97/15588 PCT/US96/17325 

-33- 

Table 3: Current Status of the Model 

statistics for the data UM< j j n rpfin^mffnr 



resolution (A) 


Rfactor (%) 


completeness (%) 


8.0-4.3 


22.4 


85.7 


4.3-3.5 


19.0 


89.1 


3.5-3.0 


20.6 


89.1 


3.0-2.8 


21.3 


87.9 


2.8-2.6 


22.3 


86.1 


2.6 - 2.4 


22.2 


84.0 


2.4 . 2.3 


22.7 


81.3 


2.3-2.2 


24.0 


78.3 


8.0 - 2.2. A 


21.3% 





model : 

molecules in the asymmetric unit: 2 
residues (out of 904 possible): ^ 
sugars; ^ 
waters: 

296 



r.m.s.d. bond length (A): 
r.m.s.d. bond angles (•): 



0.012 
1.72 



average B-values for main chain atoms (A 2 ): 
side chain atoms (A 2 ): 



16.6 
18.3 
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Table 4 



10 



15 



20 



25 



30 



His 429 



Asp 372 



N 


Ser 146 


N 


c* 




C" 


c 


His 397 


C 


0 




0 


rA 
C* 




0 


0 T 


Asp 338 


0* 


N 




N 


c- 




C* 


C 




c 


0 




0 


c* 




c* 


o* 






c« 




d 




s 


N* 1 
No 




Nu 


N 




N 


C 




c- 


c 




C 


o 




0 


a 




a 


o 




o 


0" 






0" 







His 397 



Asp 338 



0.4 
0.3 

0.7 
0.2 
0.1 
0.1 
OJ! 
0.3 
0.2 
0.2 
0.4 





aPPCA-CPW 


N , 


(A) 


C* 


0.4 


c 


0.5 


0 


04 


c 


0.4 




1.1 


N 


0.9 


L 


0.4 


C 


0.4 


0 


0.5 


c* 


0.6 


O' 


0.6 


C" 


0.5 


C" 


0.5 


N M 


0.5 




0.4 


N 


0.5 


c- 


0.2 


c 


0.1 


0 


0.1 


c 


0.1 


c 


0.2 


0*' 


0.1 




0.3 




0.1 




WO 97/15588 

_ 35 _ PCT/US96/17325 

Brunger et ai, J. Mol. Biol. 203:803-81 6 (1987) 
Briinger, A.T., Acta Cryst. A46:46-57 (1990) 

Briinger X-PLOR. Version 3.1, "A system for X-ray crystallography and NMR." Yale 
Umvers.ty Press, New Haven, CT (1992) PnyanaiNMR, Yale 

5 Chong et al. Biochim. Biophys. Acta 7077:65-7 1 ( 1 991 } 
Chong et al, Eur. J. Biochem. 207:335-343 (1992) 

Colligw « fl / ed s , current Protocols in Immunology, Greene Publishine Assoc and 
Wiley Interscience, N.Y., (1992, 1993) °"sning assoc. and 

Cowtan and Main, Aw Crystallogr. D49:l4$-\57 (1993) 
Cre.ghton, Catalysis in Proteins Structures and Molecular Principles WH 

Freeman and Company (1984), pp. 439-443 ' 
Crennell et ai., Structure 2:535-544 (Jun. 1994) 
d'Azzo et al Proc. Natl. Acad. Sci. U.S.A. 7P:4535-4539 (1982) 
Azzo et 'al, "Galactosialidosis," in The Metabolis and Molecular Bases of Inhere 

Engh and Huber, Acta Cryst. ^7:392-400(1991) 
Franken, S.M., et al., J. EMBO 70:1297-1302 (1991 ) 
Franken, S.M.. et ai, 1 EMBO 70:1297-1302 (1991) 
M Fujmagaand Read, J Appl. Cryst 20:517-512(1987) 
GMjanetal.J. Biol. Chem. 255:14754-14762 (1991) 
Goodford, J. Med. Chem. 25:849-85 7 ( 1 985) 
Guasch, A., et al.J. Mot. Biol. 22*141-157 (1992) 
Hanna et al., J. Immunol. 753:4663-4672 (1994) 

Harlow a,d Une. Antibodies: a Moratory Manual, Cold Spring Harbor Laboratory 
Henderson, Biochem. J. 727:321-333 (1972) 
Hoogevecn eta!.. J. Biol. Chem. 255:12143-12146(1983) 
Hubbes era/., J. Biochem. 285: 827-83 1 (1992) 
30 ltoh etal, J.Biol. Chem. 270:515-518(1995) 

Jackman etal.. J. Biol. Chem. 265:1 1265-1 1272 (1990) 
Jackman et al.. J. Biol. Chem. 257.2872-2875, (1992) 
Jackman etal. Hypertension 27:925-928 (1993) 
James and Sielecki, Nature 379.33-38 (1986) 
Jones, et al., Acta Crystallogr. A47: 1 1 0- 1 19 ( 1 99 1 ) 
to™*, etal. Acta Crystallogr. ^7:753-770(1991) 
Kabsch and Sander, Biopolymers 22:2577-2637 (1983) 
Kabscn, J. Appl. Crystallogr. 25:795-800 (1993) 
Kase Biochem. Biophys. Bes. Commun. 772:1175-1179 (19901 

meaicme, CRC Press, Inc., Boca Raton ( 1 995) 
Kley^egt & Jones, Bailey et al. eds.. First Map to Final Model SERC D ar «h 

Laboratory, UK, pp 59-66 ( 1 994) ' C Daresbu ry 

Kohler and Milstein. Atawe 23(5:495-497 (1975) 
« Kraul.s, J. Appl. Cryst. 2*946-950 (1991) 

Lackowski et al.. J. Appl. Cryst. 25:283-291 (1993) 
Lamzin and Wilson, Acta Cryst. ZW.129-147 (1993) 
Liver, Virology 55:78-87 (1978) 
Leatherbarrow, Trends Biochem. Sci. 73:455-458 (1990) 
Liao et al.. Biochemistry 37:9796-9812 (1992) 
LUthy et al., Nature 355:83-85 ( 1 992) 
Luthy et al. , Nature 355: 83-85 ( 1 992) 
Matthews, B.W., J. Mol. Biol. 33:491-497 (1968) 
McPhalen and James et a!.. Biochemistry 27:6582-6598 (1988) 



35 



40 



50 



WO 97/15588 PCT/US96/17325 

•36- 

Metcalf & Fusek, EMBO 72:1293-1302 (1993) 

Molecular Replacement. "Proceedings of the CCP4 Study Weekend " Machin (1985) 

Molecular Replacement. "Proceedings of the CCP4 Study Weekend," Dodson et ai 
(1992) ** 
5 Murti et al , Proc. Natl Acad. Sci. USA 90: 1 523- 1 525 ( 1 993) 

Musil et ai, EMBO 70:2321-2330 (1991) 

Nicholls, A., et at.. Proteins 77:281-296 (1991) 

Noble et al, FEBS Lett. 357:123-128 (1993) 

Oho. B-H, Acta Cryst. 7)57:140-144 (1995) 
10 Okamura-Oho t Y. et al, Biochim. Biophys. Acta 7225:244-254(1994) 

Ollis et ai, Protein Eng. 5:197-21 1 (1992) 

Potier*/ al,Analyt. Biochem. 94:287-296 (1979) 

Potier et ai. 1 Biochem. 2(57:197-202 (1990) 

Pshezhetsky et ai, Biochem. Biophys. Acta 1 122: 154-160(1 992) 
1 5 Pshezhetsky et ai, Biochemistry 34:243 1 -2440 ( 1 995) 

Rao, et ai, Acta Cryst. A36; 878-884 (1980) 

Rawlings & Barrett, Methods in Enzymology. 24*19-61 (1994) 

Read, R.J., Acta Crystallogr. A 42:140-149 (1986) 

Rossmann, "Improving Protein Phases" Proceedings of the CCP4 Study Weekend. (Feb 5-6 1988) 
20 Rubanyi and Polokoff, Pharmc.Rev. 45:325-415 (1994) ' 
Rudenko et ai , Structure 3: 1 249- 1259(1 995). 

Sambrook et ai, Molecular Cloning: A Laboratory Manual, Second edition, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY ( 1 989) 

25 LtoWUm) ^ W * dmd * PP * ^'^ SERC 

Scheibe et ai, Biomed. Biochim, Acta 49:547-556 (1990) 

Scriver, C.R. et ai, eds., The Metabolic and Molecular Bases of Inherited Disease 
Vol 1 1, 7th Ed., McGraw Hill Inc., p. 2825-2837 (1994) 

Smith et ai, Curr. Opinion in Structural Biology 2:490-496 (1992) 
30 Steigeman, W., PhD Thesis, Technical University, Munich, Germany 

Sussman et ai, Science 253: 872-879 (1991) 

Takimoto et ai y J. Virol. 55:7597-7600 (1992) 

Taylor et ai, 1 Moi Biol. 226:1287-1290 (1992) 

Tete-Favier, et ai. Acta Cryst. 049:246-256 ( 1 993) 
35 Tertamami et ai eds., Sialidases and Sialidosis. Perspectives in Inherited Metabolic 
Diseases, Vol. 4, Edi. Ernies, Milano, pp. 261-279 and 379-395 (1981) 

Thompson et ai, J. Virol 52:4653-4660 ( 1 988) 

Tronrud et ai,Acta Crystallogr. A 43:489-501 (1987) 

Varghese et al, Proteins 74:327-332 (1992) 
40 Verheijene/a7„ Biochem. Biophys. Res. Commun. 705:868-875 (1982) 

Vnend, J. Moi Graph. 5:52-56 ( 1 990) 

van Diggelen et al. J. Biochem. 200: 143-151 (1981) 

van Diggelen et al, Biochem. Biophys. Acta. 703:69-76 (1982) 

van Diggelen et. ai Lancet 2:804(1 987) 
45 Wahi et ai, J. Nucl Med. 24:3 1 6-325 ( 1 983) 

Wenger et al, Biochem. Biophys. Res. Commun 52:589-595 (1978) 

Winther and Sorensen, Proc. Natl. Acad. ScL 55:9330-9334 (1 99!) 

Wolf et ai, eds., Isomorphous Replacement and Anomalous Scattering: Proceedings 
<n v °f CCP4 Stud y Weekend, pp. 80-86, SERC Daresbury Lab., UK (1991) 

Yamamoto & Nishimura, J. Biochem. 79:435-442 (1987) 

Yamamoto etal. J. Biochem. 92:13-21 (1982) 

Zhou et ai, J. EMBO. 70:404-4048 (1991 ) 



WO 97/15588 



-37- 



PCT/US96/17325 



What Is Claimed Is: 

1. A method for crystallizing a human protective protein/cathepsin A (PPCA) or 
precursor human protective/cathepsin A protein (pPPCA). comprising 
(a) providing a purified PPCA or pPPCA; 
5 (b) crystallizing the purified PPCA or pPPCA using a hanging drop or diffusion 

method, to provide crystallized PPCA or pPPCA having biological activity, 

wherein the crystallized PPCA or pPPCA is resolvable using x-ray crystallography to obtain 
x-ray diffraction patterns suitable for three-dimensional structure determination of the PPCA or 
pPPCA. 

10 2 " A method according to claim 1, wherein said PPCA or pPPCA has at least one 

biological activity selected from the group consisting of enzyme protecting activity, enzyme 
modulating activity and peptide hydrolyzing activity. 

3. A method according to claim 1, wherein said crystallization step is done under 
conditions of purified PPCA or pPPCA; 2-30% PEG400-1 0,000; precipitating salt; buffers and pH 

15 7-9. 

4. A method according to claim 3, wherein the crystallization conditions are PPCA or 
pPPCA; 5-14% PEG8000, 40-80 mM tromethamine, 0.05-2.0 mM NaN, and pH 8.0-8.3. 

5. A crystallized PPCA or pPPCA, or at least one subdomain thereof, provided by a 
method according to claim 1. 

20 6- A method for providing an atomic model of a PPCA or pPPCA, comprising 

(a) providing a computer readable medium having stored thereon atomic 
coordinate/x-ray diffraction data of said PPCA or pPPCA in crystalline form, said data sufficient to 
model the three-dimensional structure of said PPCA, said pPPCA, or at least one subdomain thereof; 

(b) analyzing, on a computer using at least one subroutine executed in said computer! 
the atomic coordinate/x-ray diffraction data from (a) to provide data output defining an atomic model' 
of sa.d PPCA or said pPPCA, said analyzing utilizing at least one computing algorithm selected from 
the group consisting of data processing and reduction, auto-indexing, intensity scaling, intensity 
merging, amplitude conversion, truncation, molecular replacement, molecular alignment, molecular 
refinement, electron density map calculation, electron density modification, electron map 
visualization, model building, rigid body refinement, positional refinement; and 

(c) obtaining atomic model output data defining the three-dimensional structure of 
said PPCA, pPPCA or at least one subdomain thereof. 

7. A method according to claim 6, wherein said computer readable medium further has 
stored thereon data corresponding to a nucleic acid sequence or an amino acid sequence data 
comprising at least one structural domain or a functional domain of a PPCA or pPPCA 
corresponding to a portion of the amino acid sequences of Figures 13 or 14, and wherein said 
analyzing step further comprises analyzing said sequence data. 
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or pPPCA as / T ^ ^ ^ -«W data of said PPCA 

or pPPCA as the model output data produced by a method according tQ claim fi PPCA 

* A computer-based system for providing atomir a . » » 

(a) a computer readable medium having stored tW«n • 

(b) at least one computing subroutine, that when pv^»^ • 

computer to anaJyze the atomic coordinate/x-.y ^TZ7^Z \ T ** 
defining an atomic model of said PPCA or pPPCA said T °° ^ ° Utput 

10 —selected from me 

intensity scaling, intensity merging , ^ c conversTn Tn " ' ****** 

-o.ecula, alignment, molecular refinement electron I i 

Edification, electron map ^^^^ ZT^ *** *"* 

refinement; and g ' ngld ^ ^ment, positional 

15 ^ retrieva I means for obtainine atomir mn^»i „ 

A computer readable medium, havinc stored t ♦ • 

12 A-^r ... H uuceQ oy a computer system according to claim 9 

(a) providing a computer readable medium accord,. . . • 

PPPCA «, ,„ provi<ie ^ outpu , J* ■ ' <— » S «. of PPCA „ 

a. <«. or* sub domair. ,w, said JL, U IZ ^ "* PPCA ' pPPCA - « 

-p * «, P _;„ g Jr:r~: e r-i:: s s r from ,h < 

refine™,,, elect™ denary map calculator. elZ, f '""'"'''S™™' ™lecular 
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(d) obtaining atomic model output data defining the three-dimensional structure of 
a hgand of said PPCA, pPPCA or at least one subdomain thereof. 

13. A computer readable medium having stored thereon the model output data produced 

by a method according to claim 12. 

5 14. An isolated PPCA or pPPCA ligand, corresponding to the physical molecule of the 

atormc model of the ligand model produced by a method according to claim 12. 

15. A computer-based system for providing an atomic model of a ligand of a PPCA or 
prrLA, comprising the following elements; 

(a) acomputerreadablemediumhavingstoredthereonatomicmodeldataofaPPCA 

[) or pPPCA; 

(b) a computer readable medium having stored thereon atomic model data sufficient 
to generate atomic models of potential ligands of PPCA or pPPCA; 

. t fpprA (C) n ^ t0nCCOm P^ 

data of PPCA or pPPCA from (a) and the ligand data from (b). to determine binding sites of PPCA 
ZZl -7 1 Pr ° VidC ^ ^ dCfinin8 2 a, ° miC m0ddS ° f P° tential ^ PPCA or 

cl^io r n °z iD r d *** ~ 

onv^on, truncal, molecular replacement, molecular alignment, molecular refinement, electron 
dens,* map calculate, electron density modification, electron map visualization, model building 
rigid body refinement, positional refinement; and 

mod , f ^ , r iCVal ^ ° btaining at0miC m0del ° Ut P Ut data defi ™S 'he atomic 
models of potential hgands of PPCA or pPPCA. 

Ho „ 'Ln ACOmPUtCrreadaWe ^^^ 

hgandofPPCAorpPPCA.saiddataproduc^byamethodaccordingtoclaim 15 
atomic! , ^; SO T' CAOr ^^ 

atomic model of a ligand produced by a computer system according to claim 1 5 
18. Acrystalli 2 edpPPCA,r*vmg*e^ 
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FIG. 6A 
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FIG.10B 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international report has not been established in respect of certain claims under Article I7(2)(a) for the following reasons: 

1. |~| Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 




Claims Not.: 

because they relate to parts of the international application that da not comply with the prescribed requirements to such 



.an extent that no meaningful international search can be carried out, specifically: 



3. [ J Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Obserrations where unity of intention b lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1 . j~| As ail required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3 . | | As only some of the required additional search fees were timely paid by the applicant, this international jearch report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. No required additional search fees were timely paid by the applicant. Consequently, this international search report is 

restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-4 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

ThU application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13. 1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I, claims 1-4, sharing the inventive concept of methods of crystallizing PPCA or pPPCA. 
Group II, claims 5 and 18, sharing the inventive concept of PPCA protein. 

Group III, claims 6-11, sharing the inventive concept of a method of providing an atomic model of PPCA. 

Group IV, claims 12-13 and 15-16, sharing the inventive concept of a method of providing an atomic model of a ligand 
for PPCA. 

Group V, claim* 14 and 17, sharing the inventive concept of a ligand for PPCA. 

The inventions listed as Groups 1-V do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons: The claims 
are not so linked by a i pec is] technical feature within the meaning of PCT Rule 13.2 so as to form a single inventive 
concept. The * special technical features" means those technical features that define a contribution over the prior art. 
(See PCT Rule 13.2.) Because PPCA was known in the prior art (see description at page 1, lines 16-18). it cannot 
form the basis of unity of invention. Therefore, the main invention which forms a single inventive concept is Group I, 
claims 1-4, which is a method of crystallizing. Group II has the inventive concept of a PPCA protein, Group III has 
the inventive concept of a method of providing an atomic model of PPCA, Group IV had the inventive concept of 
providing an atomic model of a ligand for PPCA and Group V has the inventive concept of a ligand for PPCA; none of 
these Groups share the special technical of Group I therefore, unity of invention is lacking. 
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